$271 Discovering Physics 


$271 Unit 12 THE OPEN UNIVERSITY 
Science: A Second Level Course d 


12 Special relativity 


THE OPEN UNIVERSITY 
Science: A Second Level Course 


$271 Discovering Physics 


Unit 12 Special relativity 


Prepared by Bob Lambourne for the Course Team 


Contents 
Table A 3 

Study guide 5 
Introduction 5 


The constancy of the speed of light 3 
What is meant by ‘the speed of light’? 5 
Is c the speed relative to the source? 7 
Is c the speed relative to space? 8 


Is c the speed relative to the detector? 11 


The principle of relativity 12 
Introducing the language of relativity 12 


Events, coordinates, clocks, frames of 
reference, and observers 12 


Inertial frames of reference 16 
Using a frame of reference 18 
The principle of relativity 20 


Summary of Section 3 23 


Coordinate transformations 23 


A simple example of a coordinate 
transformation 23 


The central problem of relativity 26 


The Galilean coordinate 
transformation 29 


Summary of Section 4 33 


The Open University Press 


Special relativity 34 

Einstein’s two postulates 34 

The Lorentz transformation 33 

Special relativity at low speeds 36 

The nature of time in special relativity 37 

The velocity transformation of special relativity 39 
Lorentz contraction 41 

Time dilation 43 

Experimental support for special relativity 46 


Summary of Section 5 47 


Relativistic physics 47 
The scope of relativistic physics 47 


Relativistic mechanics 48 
Relativistic momentum 48 
Relativistic force 49 
Relativistic energy 50 


Relativistic energy conservation 52 


Summary of Section 6 54 
Conclusion 54 

Objectives 55 

ITQ answers and comments 55 
SAQ answers and comments 56 


Fold-out page: standard configuration 
and the Lorentz transformation 61 


S271 Course Team 


Course Team Chairman: Steve Swithenby 


General Editor: Stuart Freake 


Gerry Bearman (Editor) 

Nick Brenton (BBC-OUP) 

Alan Cooper (Author) 

Robin Cooper (Course Coordinator) 
Alan Durrant (Author) 

Baz East (Graphic Artist) 

Ian Every (Academic Computing Service) 
Graham Farmelo (Author) 

Stuart Freake (Author) 

Keith Hodgkinson (Author) 

Barbara Hodgson (Course evaluation) 


Tony Jolly (BBC-OUP) 


List of Units 


1 Surveying the Universe: an example 
of physical enquiry 


Describing motion 
Dynamics—forces, energy and motion 


Equilibrium, stability and gravitation 


Electrostatics 


2 

3 

4 

5 Mechanics revision 
6 

7 Currents and circuits 
8 


Electrodynamics 


Bob Lambourne (Author) 
Raymond Mackintosh (Author) 
Laurie Melton (Library) 

Cheryl Newport (Course Coordinator) 
Jim Ramage (Author) 

Shelagh Ross (Author) 

Eileen Scanlon (Course evaluation) 
Jane Sheppard (Designer) 

Milo Shott (Author) 

John Stratford (BBC-OUP) 

Steve Swithenby (Author) 


David Tillotson (Course Editor) 


Bill Toothacker (Wayne County 
Community College, Michigan) 


John Walters (Author) 


9 & 10 Vibrations and waves 


11 Temperature and heat: towards a 
microscopic explanation 


12 Special relativity 


13 The beginnings of modern atomic physics 


14 Quantum mechanics: theory 
15 Quantum mechanics: applications 


16 Modern physics in dying stars 


The Open University Press, Walton Hall, Milton Keynes, MK7 6AA 


First published 1982. Reprinted 1984, 1987. 
Copyright © 1982 The Open University. 


All rights reserved. No part of this work may be reproduced in any form, by mimeograph or 
any other means, without permission in writing from the publisher. 


Designed by the Graphic Design Group of the Open University. 


Filmset by Composition House Limited, Salisbury, Wiltshire. 


Printed in England by Staples Printers St Albans Limited at The Priory Press 


ISBN 0 335 16123 5 


This text forms part of an Open University course. For general availability of supporting 
material referred to in this text please write to: Open University Educational Enterprises 
Limited, 12 Cofferidge Close, Stony Stratford, Milton Keynes, MK11 1BY, Great Britain. 


Further information on Open University courses may be obtained from the Admissions 
Office, The Open University, P.O. Box 48, Walton Hall, Milton Keynes, MK7 6AB. 


1.3 


Table A List of scientific terms, concepts and principles used in Unit 12. 


Introduced in earlier Units, or 
assumed from general 


Introduced or developed in 


knowledge (GK) Unit this Unit Page 
acceleration a, = dv,/dt 2 absolute time 30 
binary star 1 Cartesian coordinate system 12 
centre of mass 3 central problem of relativity 28 
charge 6 constancy of the speed of light | 5 
components of a vector 2 coordinate transformation 25 
conservation of energy 3 cosmic ray 43 
conservation of momentum 3 Einstein’s postulates 34 
Coulomb’s law 6 ether 8 
electromagnetic radiation 8 event 12 
electromagnetism 8 force-free particle 18 
elementary particle GK | frame of reference 15 
force 3 Galilean coordinate 
interference 9& 10 Bas 29 
miemal iey Een 31 
kinetic energy : general relativity 34 
laser GK | halflife 46 
light yea : inertial frame 17 
mes ; 3 inverse coordinate 
Maxwell's equations 8 transformation 25 
momentum 3 Lorentz contraction 43 
monochromatic light 9 & 10 | Lorentz transformation 35 
neutron GK | Michelson-Morley experiment | 10 
Newton's laws 3 muon 46 
orbit 4 observer 16 
photon GK | origin 13 
postulate GK | pi-zero meson (n?) 53 
power 3 position coordinates 13 
proton GK | principle of relativity 21 
quantum field theory 8 relative time 37 
scalar product 3 relativistic energy 51 
speed 2 relativistic force 49 
speed of light in a vacuum c 8 relativistic kinetic energy 51 
synchronized GK | relativistic mass 48 
thermodynamics 11 relativistic mechanics 48 
velocity v, = dx/dt 2 relativistic momentum 48 
x-component (of a vector) 2 relativity of simultaneity 38 
rest mass 48 
rest-mass energy 52 
special relativity 34 
standard configuration 27 
synchronization procedure 15 
time coordinate 13 
time dilation 45 
twin paradox 46 
velocity transformation 39 


S271 UNIT 12 


FRONTISPIECE 


Albert Einstein 
(1877-1955) 


2-1 


Study guide 


This Unitconsists oftwo components: a Main Text and an associated TV programme. 
The Text is written in such a way that each Section is strongly dependent on the 
preceding Sections. As a consequence, no part ofthe Text can be regarded as optional. 
On the contrary, every Section is essential reading if you want to understand the 
Sections that follow. 


The TV programme associated with this Unit, TV12 Special relativity, deals with 
some of the experimental evidence that supports special relativity. The programme 
has been designed in such a way that it should assist you, no matter what stage you 
have reached in studying the Unit. However, you will gain the maximum benefit 
from TV12 if you have read Sections 1-5 before you view the programme. 


In working through this Unit you will find that Section 5 on special relativity is both 
the longest and the most intellectually demanding of all the Sections. For this reason 
you should try to allocate at least half of your study time to Sections 5, 6 and 7. To 
make this possible you should read Section 2 rather quickly, but you should not try 
to rush through Sections 3 and 4. 


Introduction 


When Albert Einstein proposed the special theory of relativity in 1905, his deductions 
were based on two postulates—two fundamental principles that he assumed to be 
true. These two assumptions led Einstein to a new view of space and time, a view 
that was rapidly accepted by the scientific world. Naturally, a new attitude towards 
such basic matters as the nature of time and space resulted in changes in many 
already well-established areas of physics. There was an Einsteinian revolution, the 
magnitude of which has rarely been equalled in the history of physics. 


Although this Unit does not follow Einstein's work in detail, it does have the same 
overall approach. Apart from the introduction, the Main Text is divided into five 
parts. The first two Sections, 2 and 3, each explore one of the basic postulates of 
relativity. In Sections 4 and 5 these two principles are brought together, and their 
implications for time and space are investigated. The final Section, Section 6, in- 
vestigates the impact that special relativity had on mechanics, an area of physics 
that had seemed well-understood prior to the 1905 revolution. 


Special relativity is a beautifully logical theory, and this is reflected in the ordering of 
material in this Unit. Because of this logical ordering, you should never try skipping 
parts of the Text. By proceeding methodically, you may come to appreciate that, for 
many physicists, the fascination of Einstein's theory does not come from its extra- 
ordinary predictions, but rather from the irresistible, unrelenting, logical force of the 
arguments that lead to those predictions. 


The constancy of the speed of light 


What is meant by 'the speed of light'? 


Imagine that you are in a car travelling on a long straight motorway, and your 
speedometer shows 60 m.p.h. If you are overtaken by another car travelling 20 m.p.h. 
faster, how would you answer the question “What is the speed of the overtaking car?’ 
You would probably say that it was 80 m.p.h. Most people would understand what 
you meant, even though an hour later the overtaking car would be only 20 miles 
ahead of you. However, if you wanted to be really clear, you would say that the 
overtaking car was travelling with a speed of 80 m.p.h. relative to the road, or you 
might even say that its speed was 20 m.p.h. relative to your own car. Whichever way 
you choose to describe the speed of the other car—and the choice is yours—you must 
always have in mind some reference point or body (the road, your car, another car, 
etc.) relative to which the speed is to be measured. Normally, when talking about 
cars, we refer to speeds relative to the road. That is why the answer ‘80 m.p.h.’ is 
usually enough; the words ‘relative to the road’ are assumed, they do not need to be 
mentioned. Nevertheless, the fact that speeds are normally measured relative to the 


S271 UNIT 12 


road is simply a convention chosen for convenience, and like all conventions it can 
be changed. Figure 1 shows the effect of changing the conventional point of reference 
when measuring speed. 


stationary Point of 
reference 
View 1 60 m.p.h 80 m.p.h. 
the road as the z > > 
point of reference (Ser! Coe 
bridge your car overtaking car 
60 m.p.h point of 
——— reference 
View 2 20 m.p.h 
your car as the = 
point of reference Cpe 
bridge your car overtaking car 
80 m.p.h point of 
: uM reference 
View 3 
the overtaking car 20 m.p.h 
as the point 
of reference Ser 
bridge your car overtaking car 


This example of relative motion teaches us a simple but important lesson: whenever 
a statement is made about speed, it must always include a convention about the reference 
point or reference body relative to which the speed is measured. This state of affairs is 
summarized by saying ‘speed is a relative quantity’. Whenever you are told the speed 
of some object, your immediate response should be to ask ‘speed relative to what”. 
Now let us look at a situation in which it is vitally important to remember to ask 
this question. 


As you saw in Unit 8, Maxwell’s electromagnetic theory predicts that light travels 
through a vacuum at a constant speed c, given by 


1 


vV foo 


But what does Maxwell’s prediction really mean? What reference point, or reference 
body, should be used when trying to interpret Maxwell’s prediction? As you will see 
shortly, Maxwell’s own answer to this question was quite wrong, though his method 
of calculating c remains valid. Try to formulate your own answer by considering the 
following question. 


Q 


= 2998 x 108 ms™! (1) 


Light from an explosion on the surface of a distant star is detected by a photo- 
graphic plate on a satellite orbiting the Earth. Maxwell’s theory tells us that 
the speed of the light as it travels through space is c. Can you suggest three 
possible interpretations of Maxwell’s prediction? In other words, can you 
name three objects, or reference points, relative to which the light may be 
moving with speed c? 


Any light signal that is seen must have originated somewhere; it must have travelled 
away from its source and eventually have fallen on a detector (such as a photo- 
graphic plate), otherwise it would not have been observed. This simple analysis 
suggests three possible answers to the question. It seems likely that c has one of the 
following three interpretations: 


1 c could be the speed of light relative to its source (in this case the distant star). 
This seems a rather natural and attractive possibility. 


2 ccould be the speed of light relative to the space through which it propagates. This 
probably seems a rather strange alternative since the vacuum of space does not appear 
to provide any obvious reference points. This idea will be explained and clarified in 
Section 2.3. 


Figure 1 The effect of changing the 
conventional point of reference when 
measuring speed. 


2.2 


3 c could be the speed of light relative to the detector. This may not seem very likely 
on the grounds that the light does not encounter the detector until the very end of its 
journey. 


Of course, there are many other possibilities; for example c might be the speed of 
light relative to the Sun, though it is hard to see why a light ray travelling from a 
distant star to an orbiting satellite should have any special relationship to an ordinary 
star like the Sun. Or c could be the speed relative to the Earth, though again, there is 
no obvious reason why it should be. The three numbered possibilities listed above 
are the simplest and most plausible answers to the question. The rest of this Section 
will be devoted to trying to decide which, if any, of the three possible interpretations is 
the correct one. 


Is c the speed relative to the source? 


If light consisted of bullet-like particles that obeyed the laws of Newtonian mechanics, 
it would be very reasonable to interpret c as the speed of light relative to its source. 
You could imagine that the particles of light, emerging with speed c from a luminous 
object, were like bullets leaving the barrel of a gun with fixed muzzle velocity. If the 
gun were moving towards the target when fired, then the bullets themselves would 
move more rapidly towards the target than if the gun had been stationary. Similarly, 
if this ballistic picture of light is correct, light emitted from a source that is approach- 
ing a detector should have a higher speed relative to the detector than light from a 
source that is either stationary or moving away from the detector. This is a kind of 
mechanical view of light; that’s why it’s rather easy to picture. Newton would 
probably have been quite happy with this interpretation of the speed of light, but is it 
correct? We need an experimental test to help us decide. 


In principle, such an experiment is simple. All that is needed is a device that can 
measure the speed of light, and a moving source of light. Nature provides the required 
kind of source in the form of a multiple star. About a third of all the points of light 
seen as stars in the night sky are, in fact, systems consisting of two or more stars 
orbiting about their common centre of mass. Indeed, the closest stars to the Sun are 
those which form the triple system of Alpha Centauri. Binary systems, composed of 
just two stars, are the most common of the multiple star systems. Many binaries are 
sufficiently close to the Sun that the two separate stellar components can be easily 
resolved with a small telescope. Though, of course, even those nearby binaries are a 
few light years away. 


Imagine a binary system in which one of the stars is very massive, while the other is 
much lighter and performs circular orbits about its massive companion, as shown in 
Figure 2. Suppose that the speed of the orbiting star (relative to its massive com- 
panion) is 10° m s^ +, and in order to avoid unnecessary complications think of the 
Earth as being at rest relative to the more massive component of the binary system. 
Now consider an astronomer on Earth making observations of the orbiting star. 


et e —— 


ctv 
path of 
-— orbiting 
towards star 
the Earth e 


massive star 


B v 


When the orbiting star is at position A, it is moving towards the Earth with speed v, 
and any light that it emits should, if the bullet-like theory is correct, have speed c + v 
when observed from the Earth. Similarly, half of an orbit later, when the star is at B, 
it is moving away from the Earth with speed v, and the light that it emits should have 


S271 UNIT 12 


Figure 2 In the simple bullet-like model, 
light from a distant star has a speed, 
relative to the Earth, that depends on the 
motion of the star at the time of emission. 
Thus, relative to the Earth, the speed of 
the light emitted when the orbiting star 

is at A should be greater than when the 
star is at B. 


2.3 


speed c — v relative to the Earth. Thus, relative to the Earth, the speed of the light 
coming from the orbiting star depends on the point in the orbit at which it was 
emitted. 


In practice, observations of binary stars show that the speed of the light from the 
orbiting star always has the same value, irrespective of the point in the orbit at which 
it was emitted. Such observations fail to detect any evidence that the speed of light is 
influenced by the motion of its source. A number of non-astronomical experiments, 
one of which is shown in TV12, lead to the same result. There is simply no experi- 
mental support for the idea that the speed of light depends on the motion of its 
source. 


In view of such experimental evidence, we must rule out the idea that light consists 
of particles that obey the laws of Newtonian mechanics and have speed c relative to 
their source.* 


SAQ1 Two spacecraft, each carrying a laser, are travelling directly towards 
the surface of a planet. The speed of one of the craft relative to the planet is 
2 x 10°ms_! while the speed of the other is 4 x 10° ms. If both craft fire 
their laser beams simultaneously from a range of 10? m, which of the two 
beams will strike the planet first, and what will be the delay before the other 
beam strikes? 


Is c the speed relative to space? 


This Section deals with the interpretation of c that Maxwell himself would have 
given, though he would not have posed the question given in the title in terms of 
‘space’. Maxwell would have used a more technical term—the ether—a term that 
will now be explained. 


Apart from electromagnetic radiation, the waves encountered most frequently in 
physics are those of sound propagating through the air. It is well known that sound 
is unable to travel through a vacuum, a fact which is explained by saying that in a 
vacuum there is no substance—no medium—to carry the sound wave. Throughout 
the nineteenth century it was felt that electromagnetic influences also needed some 
kind of medium for their propagation. The realization that light, which certainly 
travels through a vacuum, is an electromagnetic wave, forced those who believed in 
an electromagnetic medium to grant it rather strange properties. For example, the 
medium had to fill space (otherwise the stars would not have been visible) yet 
astronomical bodies such as the planets had to move through the electromagnetic 
medium without friction, or else their orbits would have changed. This peculiar all- 
pervading medium, which propagated electromagnetic effects but apparently did 
little else, was called the ether. 


Asking ‘Is c the speed of light relative to space?’ does not really make sense. Space 
does not supply any natural reference points that can be used when measuring speeds. 
But it is perfectly sensible to ask ‘Is c the speed relative to the ether? just as it would 
be sensible to enquire about the speed of sound relative to the air through which it 
moves. 


Having formulated a sensible question about the speed of light in terms of the ether, 
you can see that an obvious goal for nineteenth century physicists was to test the 
ether interpretation experimentally. In a letter written in 1879, Maxwell shows that 
he knew of at least two possible bases for such an experiment. His ideas were based 
on the assumption that the Earth’s orbital motion around the Sun ensured that the 
Earth was moving relative to the ether, and this relative motion should influence the 
speed of light relative to a detector fixed on the Earth. One idea, briefly mentioned 
by Maxwell, involves the amount of time required for a beam of light to travel from 
a point A to a mirror at a point B and then to return to point A (Figure 3). Maxwell 
realized that, according to the ether interpretation, the time required for the round 
trip from A to B and back would not only depend on the distance between A and B, 
but would also be sensitive to two other factors: 


1 the speed of the line AB relative to the ether; and 


2 the orientation of the line AB. 


* Note that we are not saying that light does not consist of particles at all, but merely that it 
does not consist of a certain kind of particle with certain, specific properties. 


ether 


mirror 


a> 
Y 


Figure3 A beam of light travels from 
point A to a mirror at point B and is then 
reflected back to A. The distance between 
A and B is fixed. 


Maxwell knew that these latter two factors would only have a very small effect on 
the round trip time, but he also knew that if that effect could be measured it would 
provide valuable information about the ether. It would even be possible to work out 
the speed of the Earth relative to the ether. 


A simple analogy may help you to understand exactly why you should expect the 
round trip time to be influenced by the two factors given above. Imagine that you 
are on board a boat steaming across a perfectly calm lake. Also imagine that you 
are a strong and steady swimmer and that you always swim at a constant speed 
relative to the water. 


This situation is analogous to the one considered by Maxwell; the water of the lake 
represents the ether, and the motion of the boat relative to the water represents the 
motion of the point A relative to the ether. Of course, your ability to swim at a 
constant speed relative to the water is the analogue of the Maxwellian idea that light 
always travels at speed c relative to the ether. Now suppose that a second boat, called 
P, is steaming across the lake at the same speed v as your boat and on a parallel 
course, but at a constant distance of 20 m from your boat. Two of the many possible 
arrangements of the two boats are shown in Figure 4. If you decided to swim from 
your boat to boat P and back, what factors do you think would influence the amount 
of time required for the round trip? Well, obviously the speed of the boats will 
be important. This is easily seen if you imagine that P is travelling directly ahead 


Case (a): boat P directly 
astern of your boat 


your boat | 
| l 
+20 m——4 


your boat 


Case (b): boat P off to 

one side of your boat 
of your boat; the higher the common speed of the boats, the longer it would take 
you to catch up with P. (Indeed, if P were moving more rapidly than you could swim, 
you would never catch up with it!) So the round trip time certainly depends on the 
speed of the boats. You would also find that the time depended on the direction of P 
from your boat. For example, if P was directly astern, as in Figure 4a, you would not 
be surprised to find that the round trip time was different from what it would have 
been if P was off to one side, as in Figure 4b. Just as the time required for your swim 
would depend on both the speed of your boat relative to the water and the direction 
of P from your boat, so the time required for a light ray to travel from A to B and back 
depends on both the speed of the line AB relative to the ether and the orientation of 
the line AB. 


The first physicist to devise a sufficiently precise experiment to test the implications 
of the ether hypothesis was an American, A. A. Michelson. Figure 5 provides a 
simplified plan of Michelson's apparatus. A narrow monochromatic beam of light 


semi-silvered 
mirror 


original beam 


w 


bo; 
| 
source 
of light 


AY 


recombined 
beam 


telescope 


Y observer 


$271 UNIT 12 


Figure 4 Two boats travel along parallel 
courses at the same speed, separated by a 
constant distance of 20 m. Only two 
arrangements are shown, but many others 
are possible. 


Figure 5 The apparatus used by 
Michelson in his attempt to detect the 
effect of the ether on the time required 
for light to make a round trip between 
two points separated by a distance L. 
The motion of the Earth moves the entire 
apparatus relative to the ether. 


was shone onto a semi-silvered mirror* at point A, and the mirror split the original 
beam into two parts that travelled at right angles to each other. The half of the 
original beam that had passed through the semi-silvered mirror was reflected by a 
mirror at point B, at a distance L from point A. The other half of the original beam— 
the half that had been reflected at A—was reflected again, by a mirror at point C, 
which was also at a distance L from A. After being reflected at either B or C, the two 
parts of the original beam returned to A where they recombined. The recombined 
beam was then observed from D. 


Having studied Units 9 and 10, you should realize that when the two parts of the 
original beam recombine at A they will interfere. If the time taken for light to travel 
from A to B and back differs from the time required to go via C by a whole number 
multiple of the period of oscillation of the light wave, then the interference will be 
constructive, the two waves will reinforce one another and the point A will be brightly 
illuminated when viewed through the telescope. On the other hand, if the time 
difference is an odd number of half periods, destructive interference occurs and the 
point A will not be illuminated. So the existence of a time difference should influence 
the illumination of point A. Furthermore, rotating the whole apparatus in the plane 
of the points A, B and C should change the orientation of both AB and AC relative 
to the direction of the Earth’s motion through the ether. Such a rotation would 
therefore alter the time delay along each path and thus change the illumination 
of point A. It was this change in illumination, as a result of rotation, that Michelson 
actually tried to detect. 


Michelson first performed his experiment in 1881, and found no sign of the predicted 
time difference. His results were challenged, so he repeated the experiment in an 
improved form in 1887, with the aid of E. W. Morley. Once again there was no 
evidence of a time difference. The result of this Michelson—Morley experiment has 
been described as the most famous null result in the history of physics. As far as 
Michelson himself was concerned its implication was clear, as he wrote at the time: 


‘The result of the hypothesis of a stationary ether is thus shown to be incorrect.’ 


Despite the failure of the Michelson—Morley experiment to provide any evidence of 
the existence of the ether, the ether interpretation of the speed of light was not entirely 
discredited. One possible explanation of the null result—an explanation that 
Michelson himself accepted for some time—was that the Earth carries part of 
the ether around with it, just as it carries the atmosphere. If this were correct, it 
would mean that Michelson’s apparatus was at rest relative to the ether, in which 
case no time difference was to be expected. (If the two boats considered earlier 
were both at rest relative to the water, the round trip time for the swim to boat 
P and back would be independent of the direction of P.) Unfortunately for those 
who believed in the ether, this more-complicated mobile-ether theory also failed 
to find any experimental support. As more and more results accumulated, tenable 
models of the ether were forced to become increasingly complicated. The growth 
of complexity led to mounting dissatisfaction with existing theories and a willingness 
to consider quite radical alternatives. 


It was into this atmosphere that Albert Einstein introduced his special theory of 
relativity. Einstein’s first publication on the subject was his 1905 paper, ‘On the 
Electrodynamics of Moving Bodies’. It presented a new way of interpreting electro- 
magnetism that retained the successes of Maxwell’s theory but did away with the 
superfluous speculations about the ether that Maxwell had employed. Thus the ether 
theory was finally laid to rest by a combination of experimental discovery and 
theoretical innovation. 


SAQ 2 Imagine that you have been asked to repeat the Michelson—Morley 
experiment as one of your Home Experiments (you won't be: the 1887 version 
was very heavy). Suppose you have set up the experiment on your kitchen 
table, observed the interference effect through the telescope and found that 
rotating the table makes no difference to the pattern of illumination. Thinking 
that you have thus disproved the hypothesis of a stationary ether, you are 
packing your equipment away when your neighbour visits you. He reveals 
that he has long believed in the existence of the ether and explains your null 
result by saying that the direction of the Earth’s motion through the ether just 


* A semi-silvered mirror will reflect part of the beam while allowing the rest to pass through. 


10 


Michelson-Morley experiment 


2.4 


happens to be perpendicular to the plane of your table top experiment. Would 
your neighbour’s argument explain your null result, and if so what additional 
observations would convince you that the Earth was not moving through the 
ether? 


Is c the speed relative to the detector? 


“When you have eliminated the impossible, whatever remains, however improbable, 
must be the truth.” 


Sherlock Holmes in The Sign of Four by Sir Arthur Conan Doyle. 


Three possible interpretations of the phrase ‘the speed of light’ were given in Section 
2.1. Experimental evidence has dismissed two of them, so only one remains. Is the 
third possibility correct? Is c the speed of light relative to the detector? If so, we will 
have to accept some surprising consequences, as the following imaginary experiment 
shows. 


Consider two railway trains, A and B, travelling at different speeds V, and Vg respec- 
tively, in the same direction, along two parallel tracks (Figure 6). Suppose that both 
trains are heading towards the same distant source of light, and that each carries a 
scientist equipped with a device capable of measuring the speed of light coming from 
that distant source. 


light 
from 
a 
distant 
source 


Now imagine that just at the moment when the faster train is passing the slower one, 
each scientist uses his device to measure the speed of the light from the distant source. 
If c is the speed of light relative to its detector, then both scientists must obtain the 
same result, namely c, even though the scientists themselves are moving at different 
speeds. (Neglect the fact that the measurements are made in air instead of a vacuum.) 
You should find this prediction rather surprising; it probably conflicts with your 
usual ideas about relative speed. If the two scientists had both been looking at the 
same oncoming train, instead of a light beam, and if each had measured the speed of 
the oncoming train relative to his own train, then you would expect them to get 
different results. Obviously, if c is the speed of light relative to its detector, the be- 
haviour of light fails to conform with everyday ideas about relative speed. 


Whether you find it surprising or not, c really is the speed of light relative to its 
detector. This interpretation of c is consistent with the binary star experiment, and 
with the Michelson—Morley experiment. Moreover, no experiment has ever pro- 
vided any convincing evidence to the contrary. The fact that the constancy of the 
speed of light relative to its detector conflicts with everyday experience just shows 
how limited and unreliable everyday experience is. 


The constancy of the speed of light relative to its detector is a surprising discovery 
and it has surprising consequences. However, because it is supported by experiment 
we shall adopt it as a trustworthy principle and use it throughout the rest of this 
Unit. Indeed, in Section 5.1, you will see that Einstein used a slightly refined version 
of this principle as one of the two basic postulates upon which he founded the special 
theory of relativity. 


SAQ 3 Suppose that you are on board a spacecraft travelling towards the 
Earth at a speed of 1.5 x 10°ms”!. You are equipped with a laser that you 
are using to send signals to a detector orbiting the Earth. When your signals 
are received, it is found that their speed relative to the detectoris3 x 108 ms. 
If you were to fix a device for measuring the speed of light across the front of the 
laser, so that you were measuring the speed of the laser signal relative to its 
source, what result would you find? 


S271 UNIT 12 


Figure 6 Two trains, A and B, are 
travelling in the same direction, but at 
different speeds, along two parallel 
railway tracks. 


11 


3.1 


3.2 


The principle of relativity 


Introducing the language of relativity 


‘The theory of relativity is intimately connected with the theory of space and time. 
I shall therefore begin with a brief investigation of the origin of our ideas of space and 
time, although in doing so I know that I introduce a controversial subject.’ 


(Albert Einstein*) 


The special theory of relativity provides many insights into the nature of time and 
space. Any theory that deals with such fundamental concepts will obviously have 
far-reaching consequences. It will strongly influence our whole view of physics and 
force us to think carefully about the correct way to formulate physical laws. It is 
not surprising that so profound a theory is most easily discussed with the aid of a 
number of technical terms. These terms provide relativity theory with a powerful, 
clear and incisive language that is both well-defined and flexible. This Section is 
mainly concerned with the definitions that will make that language readily accessible. 


Once you have mastered the terminology, you will be in a position to understand the 
so-called principle of relativity, introduced in Section 3.5. This principle was the 
other basic postulate that, when combined with the constancy of the speed of light, 
led Einstein to the special theory of relativity. 


The following part of the Text is radically different from the previous Section which 
was largely concerned with experimental results. In contrast, Section 3 deals mainly 
with matters of definition and deep principle. Special relativity itself, which we will 
not reach until Section 5, arises from a combination of both empirical fact and 
theoretical speculation. 


Finally, a word of advice. The theory of relativity is not really very hard, but it can 
be confusing. The key to understanding the theory is to be careful and above all to 
keep a clear mind. If at some stage your own mental picture of what is going on 
becomes blurred, go back and re-read the appropriate section immediately. 


Events, coordinates, clocks, frames of reference, and 
observers 


The special theory of relativity involves a number of rather general statements 
about physics and the physical Universe. In order to make such statements, it is 
necessary to employ several well-defined technical terms that can be used to describe 
the Universe in a rather general way. Roughly speaking, the Universe consists of 
physical processes that take place at certain times and in certain positions. The rest 
of this Section will introduce you to the technical terms that are used to improve this 
rough outline of the Universe. 


(i) Events 


An event is an idealized version of what was called a physical process above. An 
event may be regarded as something of very short duration that occurs in a very 
small region of space. Ideally it occurs at a point in space and occupies an instant of 
time. Einstein often illustrated his discussion of relativity with a rather dramatic 
example of an event: lightning striking the ground. We shall often use a small 
explosion as an equally dramatic example. 

There are two basic questions that can be asked about any event: 

Where did it take place? 


When did it take place? 


Let us look at how these questions are answered. 


Gi) Fixing the position of an event by using a coordinate system 


The simplest way to fix the position of an event without any ambiguity is to use a 
Cartesian coordinate system. You have already used such a system several times, but 


* Quoted from The Meaning of Relativity by A. Einstein, published by Methuen. This book 
was first published in 1922. 


12 


event 


Cartesian coordinate system 


this is a good place to ensure that you have a firm grasp of the concept. A Cartesian 
coordinate system is defined by three infinitely long, mutually perpendicular axes 
that intersect at a single point called the origin. Traditionally the axes are labelled 
x, y and z so that they can be distinguished, and for each axis one of the directions 
away from the origin is called positive and the other negative. The most common 
choice of positive and negative directions is shown in F igure 7. The position of a point 
on any one of the axes is specified by naming the axis and stating the distance of the 
point from the origin in metres, in either the positive or negative direction. 


positive 
z-direction 


+2 


negative 
—2 x-direction 


= T p 
negative — —2 
y-direction 


+ > 
+2 positive 
y-direction 


+2 
positive 
x-direction 


negative 
1 z-direction 


Once the origin and orientation of a Cartesian coordinate system have been chosen, 
it is a simple matter to specify uniquely the location of any point with respect to that 
system. All that is needed is a statement of the three position coordinates of the point 
with respect to the coordinate system. For example, the point with position co- 
ordinates x —2m, y = —1m,z = 1.5 m is shown in Figure 8. 


(iii) Fixing the time of an event by using synchronized clocks 


Having found a way of describing where an event occurred, we now need a method 
of describing when it happened. We need a fourth coordinate—a time coordinate. 
Time, which is measured in seconds, is simply the coordinate used to distinguish 
between the different events that occur at a single position. Fixing the time of an event 
is not quite as easy as fixing its position. 


Imagine that you are standing at some particular point in a system of coordinates 
watching the events that take place right in front of you. To be able to answer ques- 
tions about the times at which particular events occurred, all you need to do is to 
look at your watch and record the time of each event as it happens. Now suppose 


S271 UNIT 12 


origin 


Figure 7 A conventional Cartesian 
coordinate system. The numbers along 
the axes represent distances from the 
origin measured in metres. 


position coordinates 


Figure 8 Fixing the location of a point 


in space by means of three position 
coordinates (x = 2m, y = —1m, 
z = 1.5 m) with respect to a given 
Cartesian coordinate system. 


time coordinate 


13 


that you suddenly see a flash of light from a distant explosion. The explosion is an 
event, so you will want to record its time. But what is its time? The time at which you 
see the flash is not the time of the explosion itself, but rather the time at which the 
light from the explosion entered your eye. Your watch is only useful for fixing the 
time of events that occur at your particular location. How can the time of the explosion 
itself be fixed, as opposed to the time at which the flash reached your position? 


An obvious method is to enlist the help of a friend. If you had sufficient foresight, 
you could equip your friend with a watch that was synchronized with your own, and 
position him at the point where the explosion is due to occur (rather him than you!). 
Your friend could then record the time of events as they happened at his position, 
including, at least in theory, the time of the explosion. This seems to be, and indeed is, 
a reasonable way of fixing the time of events at different locations. However, we have 
lightly passed over one point of crucial importance. It was stated that the two watches 
had to be synchronized. This is obviously essential; there would be no point in you 
making a list of times of events at your position while your friend did the same thing 
for his position if you had no way of relating the two lists. 


How is it possible to make sure that two watches, of identical construction, at 
different locations are synchronized? Well, there seems to be an obvious answer; 
before taking up your assigned positions, you and your friend could simply meet at 
some point and synchronize watches. But unfortunately this attractive solution has a 
flaw. It assumes that after the watches have been synchronized in one place, the 
process of moving them elsewhere will not affect their synchronization. Experiments 
with very accurate atomic clocks show that this assumption is not valid. When 


your friend You and your friend 
meet and agree that 
when your watch shows 
that the time ist = 10s 
you will turn on your 
torch. 


You and your friend 

take up your assigned 
positions. Your friend 
knows that the distance 
between you is 6 x 105 m. 
So light takes 2 seconds 
to travel between you. 


6 x 105m —— — — — —. 


You switch on your 
torch as soon as your 
watch shows t — 10s. 


Your friend sees your 

light signal. He 

knows that the signal 

started out at t = 10s 

according to your watch, 

and he knows the signal 

took 2 seconds to reach him. 

So he starts his watch 

at 12 seconds, as soon Figure 9 A method for synchronizing 
as he sees your signal. clocks or watches at different locations 


by means of light signals. 


6 x 105m 


clocks are moved around after synchronization and then brought back together 
again, their relative motion may well have caused a loss of synchronization. Provided 
the relative speed of the two clocks is small compared with the speed of light the 
amount of desynchronization that occurs is very small} but even so, a better method 
of synchronization must be found. 


In order to fix the time of the explosion, what you and your friend need is a syn- 
chronization procedure in which your watches do not have to be moved once they 
have been synchronized. They must be synchronized when they are already in 
position and at rest relative to one another. It is now that our conclusion about the 
speed of light (see Section 2.4) becomes useful. It can be used as the basis for just the 
kind of synchronization procedure that is required. 


The improved synchronization procedure makes use of the fact that when your 
friend measures the speed of a light signal he is bound to obtain c as the result. If 
your friend knows both your position and his own position in a system of co- 
ordinates, he can easily work out the distance between the two of you. Being in 
possession of that information, and knowing that light signals travel with speed c, 
he can calculate the time taken for a light signal to travel from you to him. So, all that 
you and your friend have to do is to agree that at some particular time according to 
your watch, you will send a pulse of light in his direction. When he receives your 
signal, he adds the transit time of the signal to the time at which you agreed to transmit 
the pulse of light, and then sets his watch accordingly. The result should be two 
synchronized watches, one at your location and the other at your friend’s location. 
An example of this synchronization procedure is shown in Figure 9. 


To check that you have understood the synchronization procedure and its implica- 
tions, try the following SAQs. 


SAQ 4 A and B are two astronauts, separated by a distance of 3 x 101° m, 
who wish to synchronize the digital clocks on their respective spacecraft. 
Suppose that, by arrangement with B, A starts his clock from t = 0 and at the 
same instant transmits a light signal. If it takes B one second to adjust his 
clock to any desired reading, at what value of t should B start his clock when 
he receives the light signal from A? 


SAQS Consider three astronauts who are located at the points C, D and E 
along a straight line in the given order (Figure 10). Suppose that they have 
synchronized their clocks using light signals, and that at t = 10s, D sends 
out two light signals, one towards C and the other towards E. One of the 
signals is reflected by C and returns to D at t = 16s, the other is received by 
Eat t — 15s. What is the distance between C and E? 


eo 


D E 
+ © 


The synchronization procedure is very reliable. It does not involve any assumptions 
about the behaviour of moving watches or clocks, and the only physical principle 
that it assumes is the one that we have already discussed—the constancy of the speed 
of light relative to a detector. By using the synchronization procedure, you and your 
friend can easily assign a meaningful time coordinate to the explosion, even though 
you could not be present at the explosion itself with your own watch. 


Having solved the problem of fixing the time of a distant explosion, we can now turn 
to the problem that we originally set out to solve—that of assigning a time coordinate 
to any event, no matter where it occurs. What we need is not just two synchronized 
clocks, but rather an infinite number of them. If clocks were positioned at every 
point in space and all of the clocks were synchronized, then it would be a simple 
matter to fix the time of any event; all that would be required would be an observation 
of the time shown by the clock at the location of the event, at the moment that the event 
occurred. Of course, the process we have just described is not practicable, but there 
is no need to worry about that. The important point is that we now have a method 
of fixing the time of any event, at least in principle. 


(iv) Frames of reference 


We have seen that by using a system of coordinates and an infinite network of 
synchronized clocks it is possible to specify uniquely the time and position of any 


S271 UNIT 12 


synchronization procedure 


Figure 10 Three astronauts are all at 
rest relative to one another, and are 


positioned along a straight line at points 


C, D and E. 


frame of reference 


15 


3.3 


event. Any system of coordinates and clocks that makes it possible to do this is called 
a frame of reference. Once we have a frame of reference, we can give an accurate 
description of where and when events happened. Such a frame even acts as a con- 
ventional standard of rest relative to which speeds can be measured. The coordinate 
system belonging to a particular frame of reference can be thought of as a kind of 
rigid scaffolding providing the reference body that is needed when describing the 
speed of an object. 


When setting out to describe a specific physical process, we are free to use any frame 
of reference we wish. For example, suppose we want to describe the motion of a 
ball from the moment it is dropped until it hits the ground. We could use a frame like 
that shown in Figure 11, in which the ball starts at the point z = 0 and falls along the 
z-axis, so that the value of z corresponding to the position of the ball increases with 
time. On the other hand, we could equally well choose to describe the same process 
by using the different reference frame shown in Figure 12, in which the positive 
z-axis points vertically upwards and the ball falls towards z = 0. In such a frame, the 
z-coordinate of the ball decreases with time. If we wished, we could even be perverse 
and choose a frame in which the description of the falling ball becomes very com- 
plicated. For example, a frame that spins rapidly or one that moves horizontally 
during the fall of the ball. Changing the frame of reference does not change the physical 
process at all; it simply alters the way in which that process is described. 


Most of relativity theory is more or less directly concerned with the consequences 
of describing the same physical processes from different frames of reference. 


(v) The observer 


One of the major players in the drama of relativity remains to be introduced—the 
observer. Often we shall use statements of the kind: ‘The observer in frame B notes 
that .. ". An observer may be thought of as a kind of experimenter who is committed 
to using a particular frame of reference. Different observers use different frames of 
reference, and every frame of reference has an observer associated with it. When an 
event occurs, all those observers who detect it will give it three spatial coordinates to 
fix its position and one temporal coordinate to specify the time at which it occurred. 
The set of four coordinates used by one observer to define a particular event will, in 
general, be different from the set of coordinates given to the same event by a different 
Observer. For instance, the observer using the frame of Figure 11 describes the starting 
position of the ball as z = 0; but the observer using the frame of Figure 12 would 
describe the same starting point by a different value of z. 


It is important to have a clear understanding of the role of the observer. He should 
not be regarded as someone fixed at one point in a frame, watching events taking 
place far away and recording the time at which the light from those events enters his 
eyes. No: when an observer detects an event, the time that he assigns to it is the time 
shown on one of the synchronized clocks in his frame—the one at the location of the 
event— when it happens. Perhaps the best way to think about the process of observing 
is to imagine that you have an infinite number of friends, one of whom is located at 
each point in space, and that they all have identical watches that are synchronized 
with your own watch. If you and all your friends used your watches to record the 
times of events at your own respective locations you would be performing the same 
task that we expect the observer to perform. 


SAQ 6 Suppose that you are located at the origin of a certain frame of 
reference and that at time t — 12 s, according to that frame, you see the light 
from an explosion that occurred at the point x = 3 x 10° m, y = 4 x 10? m, 
z = 0. According to the observer who uses the frame of reference in question, 
at what time did the explosion itself occur? 


Inertial frames of reference 


Newton's first law of motion, as stated in Unit 3, asserts that: 


Any body remains in a state of rest or uniform motion (i.e. constant velocity) 
when no unbalanced force acts upon it. 


This law is usually interpreted as saying that acceleration is caused by ‘something’ 


and that ‘something’ is given the name force. However, as it stands, Newton's first 
law is actually meaningless; it suffers from the same problem that we encountered 


16 


e falling ball 
4 


© clock 
3 


positive 
z-direction 


Figure 11 One frame that can be 
conveniently used to describe a falling 
ball. 


positive 
z-direction 


e falling ball 
" 


x 


Figure 12 Another convenient frame for 
describing the motion of a falling ball. 


observer 


when trying to interpret the speed of light. The law mentions a constant velocity, 
yet we are not told the frame of reference in which that velocity is to be measured. 
Unless we have a frame of reference in mind, we cannot measure speeds, and if we 
cannot measure speeds we certainly cannot decide whether a body is moving with 
constant velocity. Newton’s first law is in need of clarification. 


The modern view of Newton’s first law is that it is not just a statement about the 
existence of forces, but rather what it really says is: 


In order to apply Newton’s second and third laws, a frame of reference must 
be used in which a body that is not acted upon by any unbalanced force moves 
with constant velocity. 


In other words, Newton’s second and third laws should only be used in those frames 
of reference in which the first law is true. The second law should not be used to analyse 
the motion ofa particle in a frame fixed on a rotating turntable, for example, because 
in such a rotating frame a particle that is not acted upon by any unbalanced force 
will not move with constant velocity. Those frames of reference in which Newton’s 
first law is true are given a special name; they are called inertial frames. 


An inertial frame is a frame of reference in which a body that is not acted upon 


by any unbalanced force moves with constant velocity, i.e. constant speed 
along a straight line. 


Or, more succinctly: 


An inertial frame is a frame of reference in which Newton’s first law is true. 


Inertial frames play a central role in special relativity. Einstein’s theory is almost 
entirely concerned with observations that are made with respect to inertial frames. 
Indeed, that is why the theory is called the special theory of relativity; it only applies 
to a special class of frames—the inertial frames. 


If we can find one inertial frame, then we can find an infinite number of them! To 
see why this is the case, imagine that a particle which is not acted upon by any 
unbalanced force is moving with constant speed v along the x-axis of some particular 
inertial frame (Figure 13a). Suppose that you are travelling along the x-axis of that 
same inertial frame with constant speed u, in pursuit of the particle. Both you and 
the particle have constant velocity relative to the inertial frame, so it is clear that the 
particle will have constant velocity relative to you. Thus, if you think of yourself as 


the origin of your own personal reference frame that you carry with you (Figure 13b) 
you can see that the particle has constant velocity relative to your personal frame. 


This shows that as long as your personal frame moves with constant velocity relative 
to an inertial frame, it too is an inertial frame. In fact, with a little thought, you should 
be able to see that every one of the infinite number of frames that have a constant 
velocity relative to an inertial frame is also an inertial frame. We can conclude that: 


Any frame that moves with constant velocity relative to an inertial frame is 
also an inertial frame. 


So, if we can find one inertial frame we can find an infinite number of them. But can 
we find one? Well, the answer is probably no: an inertial frame is an idealization. 
However, the non-existence of an ideal inertial frame is not really a problem because 
there are plenty of frames that are very nearly inertial (i.e. there are plenty of frames 
in which Newton’s first law is very nearly correct). For instance, a frame fixed on the 
surface of the Earth cannot be inertial because of the rotation of the Earth, but it 
requires a fairly accurate experiment to show that Newton’s first law is not valid in 
such a frame. If that was not the case, Newton would not have proposed his first law 
in the way that he did. The availability of approximately inertial frames means that 
special relativity can be tested just as well as Newtonian mechanics. 


Before leaving the subject of inertial frames there is one point that needs clarification. 
It was stated above that a rotating frame cannot be an inertial frame, but what 
exactly does the term ‘rotating’ mean in this context? Surely, when we say that a 
frame is rotating we really mean that it is rotating relative to something else. What is 
that something else? Strange as it may seem the answer appears to be ‘the rest of the 
Universe’. There seems to be a link between the extent to which a frame is inertial, 
and the angular speed with which it rotates relative to the distant stars and galaxies. 


inertial frame 


S271 UNIT 12 


17 


3.4 


(a) 


inertial frame 


=v 


> 


L 


y 


your personal frame 
moving with you 


particle upon which no unbalanced force 
acts, moving with constant speed v 


A frame in which the distant stars and galaxies do not on average appear to rotate, 
isa better approximation to an inertial frame than one in which they do. This peculiar 
relationship between the accuracy of Newton's first law and the apparent motion of 
the large-scale distribution of matter in the Universe is not properly understood. It 
was studied in detail by Ernst Mach in the 1870's, and Einstein hoped that the general 
theory of relativity would explain it. However, it continues to be one of the funda- 
mental puzzles of physics. 


Using a frame of reference 


A lot of time has already been spent in this Unit discussing the concept of a frame 
of reference. This Section is intended to show you why frames of reference deserve 
such emphasis. You are about to see that frames play a vital role in the process of 
discovering and writing down physical laws. 


Forget about the motion of the Earth and its gravitational influence, and pretend 
that the room you are sitting in is an inertial frame. Let one of the corners of the room 
be the origin of a coordinate system with axes labelled x, y, and z, and imagine that 
the room is full of clocks that have been synchronized with the clock on the wall of 
your room (Figure 14). These coordinates and clocks constitute a reference frame, 
and you are going to be the observer who uses that frame to carry out some physics 
experiments. 


Pretend that you don't know any physics, but you would like to discover some. The 
behaviour of a particle that is not acted upon by any forces—a force-free particle— 
would make a good starting point for your investigations. Suppose that you observe 
such a particle moving along the x-axis of your frame, and as it passes through 
various points you record both the position x of the particle, and the time t at which 
the particle passes through the point with position coordinate x. Of course, since 
you are making observations, the time t at which the particle passes through point x 
is the time shown by the synchronized clock located at the point x. 


18 


Figure 13 Viewing the motion of a 
particle that is not acted upon by an 
unbalanced force from two different 
inertial frames. 


(a) The view from the original inertial 
frame. Both you and the particle move 
along the x-axis with constant speed. 


(b) The view from your frame. The 
particle moves with a constant velocity, 
and so does the original inertial frame. 


force-free particle 


(2) a few of the synchronized 
clocks that are to be found 
everywhere in your frame 


x 


A graph of your observations would look like Figure 15. The slope of the graph 
would depend on the speed of the force-free particle relative to your frame, but the 
line would always be straight. When you had performed the experiment several 
times, always finding that the gradient of the graph was a constant, you would 
probably decide that this result deserved to be recognized as a law of physics. So you 
might declare: 


The first law of mechanics in my frame of reference is that the gradient of the 
position-time graph for a force-free particle is a constant. 


You would have just discovered Newton’s first law. If you wanted to express your 
discovery mathematically, you would naturally use the calculus notation for the 
gradient of a graph and write: 


For a force-free particle 


dx 
d constant (2) 


Of course, dx/dt means the x-component of the velocity of the particle, so you must 
be sure that you know the object relative to which the velocity is being measured. 
But in this case that is no problem; the velocity is being measured relative to your 
frame—relative to the corner of your room if you like. Actually, the notation dx/dt 
makes this clear; the x is the x of your frame of reference, and the t is the t of your 
frame of reference. Any law which you write down will naturally be expressed in 
terms of the coordinates (x, y, z and t) of your frame of reference. 


Buoyed up by your success in making one discovery that you think is a law of physics, 
you might well try to discover another law. Suppose that a positive electric charge 
was fixed at the point x = X, y = 0, z = 0 in your frame (Figure 16)*. If you released 


fixed positive charge 


` 


<y 


4 accelerating particle 
(positively charged) 


* Remember, x is a variable denoting the x-position coordinate of the particle at any time 
and X is a particular number, the value of x corresponding to some fixed point. 


S271 UNIT 12 


Figure 14 Your room as an inertial 
frame. 


s 


~ 


Figure 15 The position-time graph for 
the force-free particle that moves along 
the x-axis of your frame. 


Figure 16 A positive charge is fixed at 
x = X, y = 0, z = 0 in your frame, and 
you observe the acceleration of a 
positively charged particle released from 
a point on the x-axis. 


19 


3.5 


a positively charged particle from some point on the x-axis, you would see it move 
along the x-axis, accelerating away from the fixed charge at x = X. By using your 
frame of reference, you could measure the x-component of the acceleration of the 
particle as its distance from the fixed charge increased. A graph of your results would 
look something like Figure 17. 


You could repeat this experiment several times using different charges and different 
particles and even different values of X, making a new graph each time. After several 
repetitions you would find that, although the exact size and shape of the graph 
depended on the particles and charges involved, the graph was always of the form 


constant 


x-component of acceleration = 


(3) 


(distance between particle and charge)? 


ITQ 1 An observer who had no previous knowledge of physics would not 
have any reason to expect the result given in equation 3, but having studied 
Units 2, 3 and 6 you should be able to explain it. (a) Why is the acceleration 
proportional to the inverse square of the distance between the particle and the 
fixed charge? (b) If the fixed charge is Q, and the particle has mass m and charge 
q, what is the value of the constant in equation 3? 


Equation 3 can be rewritten in a more mathematical form that reveals the essential 
role that your frame of reference plays in the process of discovering and writing down 
physical laws. As you now know, the x-component of the acceleration can be written 
as dv,/dt. But v,, the x-component of the velocity, can be written as dx/dt. So mathe- 
matically the x-component of the acceleration can be denoted d/dt(dx/dt), which is 
written as d?x/dt? for short. Furthermore, the distance between the particle and the 
fixed charge is simply given by (x — X), where x is the position of the particle. If you 
used these pieces of mathematical notation to rewrite equation 3, you would say 
that the second law of physics that you have discovered was that: 


For a charged particle moving away from a fixed charge 


2 
dx _ constant (4) 
de- Xe 


The important point about your ‘discoveries’ is that you naturally express them in 
terms of the coordinates of your frame of reference. Neither equation 2 nor equation 4 
makes any reference to abstract quantities like forces or fields. Apart from constants, 
both of your ‘laws of physics’ are written in terms of x and t, the coordinates of your 
frame of reference. 


The message that you should take away from this Section is this: 


Any observer who sets out to write down the laws of physics will naturally 


express those laws in terms of the coordinates of his own frame of reference. 


He has to do so; they are the coordinates he has committed himself to using. You 
should now be able to see the fundamental importance of a frame of reference; it is 
the possession of a well-defined frame that makes it possible for an observer to make 
measurements and deduce physical laws. 


The principle of relativity 


After you had carried out a great many observations of the kind described in the 
previous Section, you might well feel that the results you had discovered (equations 2 
and 4) deserved to be called laws of physics. But would it not be rather presumptuous 
of you to call your discoveries laws? After all, the only evidence that you have has 
been collected in your own frame of reference, and it is certainly not the case that all 
statements which are true in your frame of reference are necessarily laws of physics. 
For example, it might well be true that the curtains in your room are red, but that does 
not make the statement ‘curtains are red’ into a law of physics. Before elevating any 
statement to the status ofa law, it obviously makes sense to find some other observers, 
in other frames of reference, who agree with your discovery. Of course, even the 
agreement of a few other observers would not guarantee that your discovery deserved 
to be a law (the other observers might coincidentally have red curtains in their 
rooms), but at least their agreement would mean that your discovery had some 
chance of being a law of physics. 


20 


x-component 
of acceleration 


—— p 
distance between the 
particle and the fixed charge 


Figure 17 The graph of the x-component 
of the acceleration of the particle against 
the distance between the particle and the 
fixed charge. 


One of the most basic assumptions of special relativity is that any law of physics that 
is true in One inertial frame will also be true in all other inertial frames. This is called 
the principle of relativity. It implies that if you make a discovery in the inertial frame 
of your room, you are only entitled to call it a law of physics if it can be shown to 
be true in every other inertial frame. This principle is so important that it is worth 
stating it in a number of different ways to make sure that you really understand it. 
One form of the principle of relativity is the following: 


The laws of physics do not allow you to distinguish between different inertial 
frames. 


A simple example will help you to understand this form of the principle of relativity. 
Suppose that you are in a spacecraft moving with constant velocity relative to a star 
that happens to be at rest in a given inertial frame. The principle of relativity says 
that provided you are completely cut off from the star (i.e. you cannot detect its 
gravitational field and you cannot see it or send signals to it) then: 


There is no experiment that you can perform enclosed in your spacecraft that 
will enable you to determine your velocity relative to the star. 


This is an inevitable consequence of the principle of relativity. If there were such an 
experiment, it would mean that the laws of physics in your inertial frame (the frame 
of the spacecraft) depended in some way on the velocity, and were different from the 
laws of physics in the inertial frame of the star. 


It is important to realize that the principle of relativity does nor claim that all inertial 
frames are the same in all respects. To appreciate this, imagine two different space- 
craft, each travelling with a different constant velocity relative to the star mentioned 
above. Each can be thought of as being at rest in its own inertial frame, and the 
principle of relativity asserts that the two frames are indistinguishable so far as the 
laws of physics are concerned. However, if it were possible to look outside each 
spacecraft, through a window say, it would be easy to tell them apart. The apparent 
velocity of the star would depend on the spacecraft (inertial frame) from which it was 
being viewed. However, this simple method of distinguishing between two different 
inertial frames does not conflict with the principle of relativity, because the velocity 
ofa star relative to any particular inertial frame is not determined by a law of physics— 
observing the velocity of a star from a particular inertial frame is just like observing 
the colour of the curtains in your room. 


When we think about the laws of physics, we tend to think of the mathematical 
equations that express those laws. For this reason, the most useful way of stating the 
principle of relativity is in a form that relates to mathematical equations. 


The principle of relativity The laws of physics can be written in the same form 
in all inertial frames. 


To explain what this means, and in particular what the phrase ‘same form’ implies, 
let us return to the two discoveries that you made in the inertial frame of your room— 
the discoveries expressed in equations 2 and 4. 


Suppose that another observer, in a different inertial frame, is willing to help you 
check your discoveries. Because this new observer is using a different frame from 
yours, we cannot call the axes of his frame x, y and z or there will be confusion. 
Instead we will call the axes of his frame x’, y' and z’. Similarly, although all the clocks 
in his frame must be synchronized with each other, there is no reason why they 
should be synchronized with your clocks, so we will call the time coordinate of the 
new frame t’, to distinguish it from your coordinate. 


In order for the new observer to check your discoveries, you must tell him what they 
are, but that presents something of a problem. On the one hand, you have naturally 
expressed your discoveries in terms of the x- and 1-coordinates of your frame; even the 
constants in equations 2 and 4 have been determined in your frame. On the other 
hand, the new observer only understands equations that are expressed in terms of 
his coordinates x’, y‘, z' and t'. What should the new observer be asked to check in 
order to provide support for your discovery? The answer to this question is provided 
by the principle of relativity. According to the principle of relativity, if your dis- 
coveries are indeed laws of physics then they can be written in the same form in all 


principle of relativity 


$271 UNIT 12 


21 


inertial frames of reference. For example, your first discovery was that the 
x-component of the velocity of a force-free particle is given by 


d 
ees constant (eq. 2) 
dt 


If this really is a law of physics, then according to the principle of relativity the new 
observer should find that in his frame the motion of a force-free particle is given by 
dx’ 
dt’ 


= constant’ (5) 


Note that in the frame of the new observer the value of the constant may be different — 
that is why it has been called constant’—but it will still be constant*. 


The second of your discoveries, the one about the x-component of the acceleration 
of a charged particle as it moves away from a fixed charge, can be treated in the same 
way. Your discovery was that 


d?x constant 


If this really is a law of physics, then the new observer should find that in his frame 


d?x' constant’ 


a? (y — XP © 


where X’ is the location of a fixed charge on the x’-axis of his frame. 


Of course, if the new observer fails to find any experimental support for equations 5 
and 6, you can conclude that your discoveries are not laws of physics, even though 
they may be accurate statements in your own frame. 


ITQ 2 Ifthe new observer finds that equations 5 and 6 are supported by 
experiment, does it follow that your discoveries are laws of physics? 


The principle of relativity is one of the foundation stones upon which the theory of 
relativity is built. It is a remarkably powerful and far-reaching principle because it 
applies to all the laws of physics, even the ones that have not yet been discovered! In 
view of its importance, it is only proper to ask what evidence there is in support of 
the principle of relativity. 


Einstein's own motives for introducing the principle of relativity are still to some 
extent a matter for debate amongst historians of science, but our reasons for accepting 
the principle today are firmly based on experimental evidence accumulated since the 
time of Einstein. In Section 4 you will see that once the form of an equation is known 
in one inertial frame, it is possible to use a mathematical technique to work out the 
form of that equation in any other inertial frame. According to the principle of 
relativity, any equations that cannot be written in the same form in all inertial frames 
cannot be laws of physics. So, without doing any experiments, this technique makes 
it possible to pick out equations that cannot be laws of physics. By using this method, 
Einstein discovered that many of the equations that were thought to be ‘laws of 
physics’ in his day could not be laws at all, even though they agreed with all existing 
experiments. Einstein proposed modifications or alternatives to many of these false 
laws—alternatives that could be written in the same form in all inertial frames, in 
accordance with the principle of relativity. Subsequent experiments have shown that 
the old ‘laws’, though adequate for their own time, do not describe the wide range 
of results that are now available. On the other hand, the modified laws that do take 
the same form in all inertial frames have been supported by experiment. It is this 
experimental support for the consequences of Einstein's theory that provides con- 
vincing evidence that the basic assumptions of the theory, including the principle of 
relativity, must be correct. 


SAQ 7 Suppose that while observing the motion of a particle in your 
inertial frame you come to the conclusion that the following equation describes 
the x-component of the acceleration of the particle: 
d’x dx 
— =— + k(x - Xy + (y- Y? + @ — Zy 
qi qt Me + - VP + - 2) 
* In more-advanced treatments of relativity, the laws of physics can be formulated so that 


constants have the same numerical value in all inertial frames. However, in our introductory 
treatment constants may have different values in different inertial frames. 


22 


3.6 


4.1 


where k is a constant and X, Y and Z are the coordinates of a second particle 
in your frame. What relationship must an observer in a different inertial 
frame—with coordinate axes x’, y’, z’ and t'—find to be experimentally valid if 
it is even worth giving further consideration to your formula as a possible law 
of physics? (Naturally, you should assume that the principle of relativity is 
correct in answering this question.) 


SAQ 8 Imagine that you are standing beside a motorway looking at a 
bridge that, naturally enough, is at rest relative to you. A friend is driving past 
in his car, which is moving with constant velocity relative to you. As he passes 
he shouts out of the window that the principle of relativity is false because he 
can distinguish his frame from yours simply by observing that the bridge is not 
at rest in his frame. Is his claim justified? Give reasons for your answer. 
(Assume that a frame fixed on the Earth is an inertial frame.) 


Summary of Section 3 


This is a good place to gather together the definitions and concepts that have been 
introduced in this Section. 


EVENT An event is a physical occurrence that occupies a point in space and an 
instant in time. 


FRAME OF REFERENCE A frame is some system that makes it possible to assign to 
any event four numbers which distinguish it from all other events. Three of the 
numbers fix the position of the event relative to a point called the origin, and these 
numbers are called the position coordinates of the event. The fourth number is called 
the time coordinate and fixes the time at which the event occurred. 


INERTIAL FRAME A frame of reference in which Newton’s first law is true. (Any 
frame that moves with constant velocity relative to an inertial frame will also be an 
inertial frame.) 


OBSERVER Someone who consistently uses a specific frame of reference to assign 
four coordinates to every event that he observes. Any observer who sets out to write 
down the laws of physics will naturally express those laws in terms of the coordinates 
of his own frame of reference. 


THE PRINCIPLE OF RELATIVITY The laws of physics can be written in the same form 
in all inertial frames. 


Coordinate transformations 


A simple example of a coordinate transformation 


In Section 3.5 we discussed the results that an observer using an inertial frame of 
reference must find, if a discovery made in your inertial frame is to stand any chance 
of actually being a physical law. (For example, unless he discovers that equation 6 is 
true in his frame, your equation 4 cannot be a law of physics.) However, it is not really 
necessary for a given equation to be confirmed experimentally in all other inertial 
frames before it can be admitted to be a possible law of physics. There is, thank 
goodness, a much simpler way. By using your own experimental results and some- 
thing called a coordinate transformation, it is possible to work out mathematically the 
results that other observers would find. A simple example will show you how this 
can be done, but first the term coordinate transformation must be explained. 


Suppose that in a room adjoining yours there is an observer who has agreed to help 
you test your discoveries. This new observer is using one of the corners of his own 
room as the origin of coordinates in his frame, and he is measuring time by means of 
clocks that are synchronized with a clock on the wall of his room. 


The axes of the new observer’s coordinate system will be called x,y and z', and they 
are shown in Figure 18. As you can see, corresponding axes in the two frames are 
parallel, and the origin of the new observer's frame is at the point x = 6 m, y = 0, 
z — 0 in your frame. This means that the point which you specify as x — 8m, 
y = 2m, z = 3m, will be assigned the coordinates x’ = 2 m, y = 2m, z = 3m, by 
the new observer. 


S271 UNIT 12 


23 


^z 


4 
j| your frame 
(your room) 
2- 
a2 14 


frame of the new observer 
(adjoining room) 


ITQ 3 (a) What values of x', y' and z' will the new observer assign to the 
following points: 

(1) —x.— 3m yi 2m: ze ome 

(i) the origin of your frame; 


(iii) the origin of his own frame? 
(b) What values of x, y and z would you assign to the following points: 


0- x c= 3m yes 7:myz-—9; 


a) == 2>my 312 — —2m? 


Of course, the new observer will also want to assign a time to the events that happen 
in his frame of reference. He can do this in the standard way by using synchronized 
clocks, but, although all of his clocks must be synchronized with one another, there 
is no reason why they should be synchronized with your clocks. (There may well be a 
clock in one ofthe rooms of your home that is always a bit out of step with your other 
clocks.) In recognition of this possible difference we will call the time coordinate used 
by the new observer t’, to distinguish it from the tcoordinate used in your frame. It is 
important to note that we always assume that two clocks which are at rest relative to 
one another run at the same rate. This means that the clocks used by the new observer 
are neither gaining nor losing relative to your clocks; they may only differ by a 
fixed amount. To illustrate this point, imagine that an event occurs at some point in 
space, and suppose that both you and the new observer are on the scene with your 
respective clocks, ready to assign a time to the event. Suppose that the time of the 
event according to your clock is t — 4.2 s, while the clock of the new observer shows 
t' = 5.48, so the difference between t' and t is 1.2 s. If another event occurs at some 
other position and you find that the time coordinate of that event is t — 8.0s then 
the new observer is bound to find that the event occurs at time t’ = 9.2 s in his frame 
of reference. 


ITQ 4 (a) What value of t' would the new observer assign to an event that 
took place at t — 3.0s in your frame? 

(b) What value of t would you assign to an event that occurred at t' = 0.6 s 
according to the new observer? 


Having answered ITQs 3 and 4 you should now be able to take the coordinates of a 
given event in your frame, and work out the coordinates of that same event in the 
frame of the new observer. In doing so you are really using the following set of 
equations: 


x —x—6m 

y=} 

: (7) 
f= 

=t+12s 


Such a relationship between the coordinates of two different frames of reference is 


24 


Figure 18 Two inertial frames of 
reference. In each case the origin of 
coordinates is in the corner of a room. 
The red x'-axis should really be 

exactly superimposed on the black x-axis. 
All distances are marked in metres. 


called a coordinate transformation. It enables you to work out the four coordinates 
of any event in the frame of the new observer, provided you know the coordinates of 
that same event in your own frame. 


In fact, in answering ITQs 3 and 4 you not only made use of the coordinate trans- 
formation given in equation 7, but you also used the inverse coordinate transformation: 


x=x'+6m 
Zee 
t=t'—12s 


This is the transformation that enables you to work out the coordinates of an event 
in your frame if you know the coordinates of that event in the frame of the new 
observer. 


Using the coordinate transformation of equations 7 and 8, we can take experimental 
observations made in your frame and work out how the same experiments would 
have appeared to the new observer. In other words, we can transform a result ex- 
pressed in terms of the coordinates of your frame into an expression involving only 
the coordinates used by the new observer. This transformed expression is the result 
that the new observer would have obtained if he had actually performed the experi- 
ment himself. An example will clarify this point. 


From Section 3.4 we know that if there is a charge fixed at the point x = X, y = 0, 
z = 0, in your frame, and if you make observations of the x-component of the 


acceleration of a charged particle as it moves away from the fixed charge, you will 
find that 


d?x constant 
d? (x-Xy (eq. 4) 


This is a result that describes the experiment as observed in your frame. What would 
the new observer have deduced if he had observed the same experiment from his 
frame? Let's work out the answer step by step by looking at each of the terms in 
equation 4. 


1 In your frame the fixed charge is at x = X, y = 0, z = 0. It follows from the 
coordinate transformation that in the new observer's frame the fixed charge is at 
x’ = X — 6m, y' = 0, z = 0. If we call the value of x’ at which the charge is fixed 
X', then we can write X = X' + 6m. 


2 Similarly, it follows from the coordinate transformation that when the particle 
is at point x in your frame, its position in the frame of the new observer is given by 
x = x — 6m. In other words, x = x’ + 6 m. 


3 In ITQ 1 you showed that the constant mentioned in equation 4 is given by 
Og/(4ne,m), where Q is the fixed charge, q is the charge of the particle, and m is its 
mass. None of these quantities is changed when the experiment is viewed from 
another room, so the coordinate transformation has no effect on the size of the 
constant. 


4 Physically, d?x/dt? is just the x-component of the acceleration of the particle. 
Observing the acceleration from the adjoining room will not alter it, so we expect 
the x'-component of the acceleration of the particle d?x’/dt”” as measured by the 
new observer, to be the same as d?x/dt”. In other words we expect d?x'/dt? = d?x/dt2. 
This result can be proved mathematically by using the coordinate transformation 
and some standard results of differential calculus. 


Using these four consequences of the coordinate transformation, we can eliminate 
all references to the coordinates of your frame from equation 4. Making the necessary 
substitutions gives: 

d?x’ constant 

d? ((x' + 6) — (X + 6))? 


d?x' constant 
ie. = = — 9 
Le di? (x — Xy (9) 
This is the result that the observer in the adjoining room would have found if he had 
carried out the experiment with the charged particle. 


Now think back to Section 3.5. One of the conclusions that we arrived at when we 
applied the principle of relativity to your discovery of equation 4 was that if equation 


S271 UNIT 12 


coordinate transformation 


inverse coordinate transformation 


25 


4.2 


4 stood any chance of being a law of physics then any observer using an inertial 
frame would have to find that a charged particle accelerating away from another 
charge obeyed the equation 


d?x' constant 
Se (eq. 6) 


di? (x= X»? 

Equation 9 shows that this is indeed what the observer in the adjoining room would 
find if he performed an experiment. So, although we have certainly not shown that 
your discovery (equation 4) can be written in the same form in all inertial frames, 
we have at least established that it retains its form in one other inertial frame—that 
of the next room. The important point to note is that we have been able to establish 
this simply by using your equation 4 and the coordinate transformation between 
two frames. 


SAQ9 Suppose that the observer in the adjoining room had chosen to label 
his coordinate axes differently so that they were arranged as shown in Figure 
19. The coordinate transformation between your frame and his would then be: 

x = —y 

y =x—6m 

EA 


fer las 


M 


Using this coordinate transformation, work out the result that the observer 
in the adjoining room would deduce if he observed the experiment that led 
you to equation 4. (In other words, transform equation 4 into the frame of the 
adjoining room using this coordinate transformation.) 


^z 


The central problem of relativity 


In the previous Section we were able to take a result (equation 4) deduced from 
experiment in your inertial frame, and use a coordinate transformation to show that 
an observer in another inertial frame would find that a result of the same form 
(equation 9) was true in his frame. Thus we established that as far as you and the 
other observer in the adjoining room were concerned, your result—equation 
4—might be a law of physics because it did not conflict with the principle of relativity. 
However, this conclusion was not really very surprising because we were only 
dealing with two frames that were at rest relative to one another. According to the 
principle of relativity, a law of physics can be written in the same form in all inertial 
frames. So, obviously, a much more stringent test of any equation that you propose 
as a possible law of physics is to transform it into an inertial frame that is moving 


26 


Figure 19 The two inertial frames 
discussed in SAQ 9. The y'-axis should 
really be exactly superimposed on the 
x-axis—they are separated for clarity. 


relative to your frame, and check that it can also be written in the same form in that 
frame. If it can, then it may be a physical law, and experiment can decide whether it 
is or not. If it cannot be written in the same form in the moving frame then it cannot 
be a law of physics, and you don’t need to waste any more time testing it. 


Suppose we have an equation expressed in terms of the coordinates of your frame 
(x, y, z and t), such as equation 4. How can we work out the form that the equation 
takes when transformed to an inertial frame that is moving relative to your frame? 
Well, obviously we need to know the coordinate transformation that links your 
frame and the moving frame. Working out that transformation is clearly a very 
important problem and it is also quite difficult, so it makes sense to solve the problem 
in its simplest form. We must think of the simplest possible arrangement of two 
inertial frames that are in relative motion, and work out the coordinate transforma- 
tion between them. 


The simplest possible arrangement of two inertial frames that move with constant 
velocity relative to each other is given the special name standard configuration. A 
precise definition of standard configuration will be given shortly, and that definition 
also appears on the fold-out page at the end of the Text so that you can consult it 
whenever you wish. 


However, in order to fix the concept of standard configuration in your mind, think 
of the two frames shown in Figure 20. Frame A is fixed on the platform of a railway 
station, with the x-axis pointing along the track. The observer in frame A measures 
time by means of clocks that are at rest in his frame, and which have been synchronized 
with the main station clock. (As usual, you should imagine that the observer in 
frame A has a synchronized clock wherever he needs one.) Frame B is fixed in a 
railway carriage that is moving relative to the station with constant speed V in the 
positive x-direction. The observer in frame B—a passenger on the train—uses clocks 
that have been synchronized with a clock fixed in his carriage in order to measure 
the time at which events occur in his frame. The frames A and B are arranged so the 
corresponding axes are parallel, and the origin of B moves along the x-axis of A. 


station clock 


= 


frame A 
(frame of station) 
clock fixed 
in carriage 
reading t 


frame B 
(frame of carriage) 


There is one additional condition needed to ensure that the two frames are in 
standard configuration. Because the origin of B is moving along one of the axes of A, 
there must be an instant at which the origins of the two frames coincide. This is an 
event in the technical sense of Section 3.2; it happens at a point, and it only lasts for 
an instant. The extra condition is that the clocks used by both observers were started 
in such a way that they both agree that the time of the coincidence event was zero 
according to their respective clocks. The example that has just been described is quite 
complicated, and you may need to read it again, but before you do, read through the 
more precise definition of standard configuration that follows. You may find it 
clearer than the example! 


The two inertial frames, A and B, shown in Figure 21 are in standard configuration 
when they satisfy all of the following conditions: 


S271 UNIT 12 


Figure 20 Two inertial frames in 


standard configuration. The two origins 


coincided at t = ť = 0. (The x- and 
x'-axes should really be superimposed.) 


standard configuration 


2 


STANDARD CONFIGURATION 4z 


frame B 


x the origins coincide at t = t' = 0 


1 The origin of frame B moves with constant speed V, in the positive direction 
along the x-axis of frame A. 


2 The x-axis of A is parallel to the x'-axis of B. The y-axis of A is parallel to the y’- 
axis of B. The z-axis of A is parallel to the z’-axis of B. 


3 There must have been a moment when the origin of B exactly coincided with the 
origin of A. According to frame A, that event occurred at time t = 0. According to 
B it happened at t' = 0. 


(A short version of this definition appears on the fold-out page at the end of this Text.) 


You should pay special attention to point 3. In the simpler case of two neighbouring 
rooms—two inertial frames at rest relative to one another—we saw that the syn- 
chronized clocks of one frame could only differ by a fixed amount from the clocks of 
the other frame. In that situation, if t and t' agreed at one instant they were bound to 
agree forever afterwards. That is not necessarily the case when dealing with two frames 
in standard configuration. Even though an observer at the origin of A may adjust 
his watch so that it agrees with the watch of an observer at the origin of B at the 
moment that they occupy the same position, it does not follow that the watches of 
the two observers will be synchronized at any later time. At this stage we have not 
shown that the relative motion of two clocks or watches actually does have any 
effect upon their synchronization. All that has been established is that the definition 
of standard configuration is so carefully worded that it does not rule out such an 
effect. Moving clocks might behave differently—it is a possibility that must be 
investigated. 


Now that the meaning of standard configuration has been explained, it is possible 
to pose the central problem of relativity. This problem must be solved if we want to 
use the full power of the principle of relativity to assess the viability of equations that 
are proposed as possible ‘laws’ of physics. The central problem of relativity is: 


What is the coordinate transformation that links two inertial frames, A and B, 


in standard configuration? 


Einstein was the first to realize that the solution to this apparently unexciting question 
held the key to a new view of time and space. This new view radically altered our 
concepts of mass and energy and brought about a revolution in physics. 


SAQ 10 Suppose that two inertial frames, A and B, are arranged in such a 
way that their corresponding axes are parallel (x parallel to x’, etc.), and the 
origin of frame B moves with a constant speed of 10 ms ! in the positive 
direction along the x-axis of frame A (Figure 22). Suppose that the time t of 
any event that takes place in frame A is determined by a clock that is syn- 
chronized with a master clock at the origin of frame A. Similarly, suppose that 
the time t’ of any event that takes place in frame B is determined by a clock 
that is synchronized with a master clock at the point x’ = 10m in frame B. 
According to an observer in frame A, the master clock of frame B is moving 
along the x-axis of frame A at 10ms_'; so, obviously, there will be one 


28 


Figure 21 Standard configuration of two 
inertial frames A and B. The origins of A 
and B have been called O and O’ 
respectively for simplicity, and coincide at 
t=r=0. 


central problem of relativity 


4.3 


clock fixed at 


x'=10m 
in frame B 
> 
clock fixed at the 4 
origin of frame A 
frame A 
when the clocks coincide t = t' = —1 s 


instant at which the two master clocks coincide. Suppose that when this 
happens each of the clocks is started at t = t' = — 1 second. If this procedure 
is followed, does the definition of standard configuration imply that these two 
frames will necessarily be in standard configuration? 


The Galilean coordinate transformation 


The coordinate transformation that Einstein proposed as the solution of the central 
problem of relativity will be discussed in some detail in Section 5. In this Section we 
are going to examine a different coordinate transformation: the so-called Galilean 
coordinate transformation. Before 1900, most physicists would have regarded the 
Galilean transformation as the ‘common-sense’ solution to the central problem of 
relativity—if they had thought of asking themselves that question. 


The Galilean transformation that relates the coordinates of an event in inertial 
frame A (x, y, z and t) to the location of that same event in inertial frame B (x^, y’, z’ 
and t^), when A and B are in standard configuration (see fold-out), is: 


x —x-—Vt 

y=y 

fat (10) 
bt 


Throughout this Section, make sure you remember that although these equations 
are a perfectly plausible solution to the central problem of relativity, they are not the 
correct solution. 


In order to appreciate why the Galilean transformation would have been regarded 
as the common-sense solution to the central problem of relativity we will examine 
some of its consequences. 


(i) Consequence 1—the absolute nature of time 


Use equation 10 to answer each part of the following ITQ: 


ITQ 5 Suppose that A and B are the two inertial frames in standard con- 
figuration that are shown on the fold-out page. If O is the origin of frame A, 
and O' is the origin of frame B, and if V = 108 m s~ !, answer the following 
questions: 


(a) At what time f, as measured in frame A, do the origins O and O' coincide? 
(b) At what time t', as measured in frame B, do the origins O and O' coincide? 
(c) When O and O' coincide, what is the x-coordinate of the point in frame 
B that coincides with the point x = 105m, y —0, z= in frame A? 


(d) At what time t, as measured in frame A, does the point x — 10? m, 
y=0, z «0 in frame A, coincide with the point x’ = 105m, y'— 0, 
z' = 0 in frame B? 


S271 UNIT 12 


Figure 22 If the clocks are both started 


at t = ť = —1s (when the two clocks 
are at the same location), does it 
necessarily follow that the two frames 
must be in standard configuration? 


Galilean coordinate transformation 


29 


(e) At what time t’, as measured in frame B, does the event described in 
part (d) above take place? 


We noted in Section 4.2 that when two frames are in standard configuration the 
coincidence of their origins is an event that occurs at time t = t’ = 0. But we also 
noted that the definition of standard configuration did not require that the time 
coordinate of any other event would necessarily have the same value in both frames 
(i.e. we did not require t = t' except for the coincidence of the origins at t = t' = 0). 
However, as you can see from equation 10, in the Galilean transformation t and 1’ 
are the same for any event, wherever and whenever it occurs. The Galilean trans- 
formation does not exploit the possibility of a distinction between t and t’. This 
means that the amount of time that separates two events according to an observer in 
frame A is identical to the amount of time that separates the same two events accord- 
ing to an observer in frame B. For example, consider two of the events discussed in 
ITQ 5; the first event is the coincidence of O and O’ and it occurs at t = t' = 0; the 
second event is the coincidence of x’ = 105 m and x = 10? m, which takes place at 
t — t' 2 9 seconds. The interval that elapses between the two events in frame A is 
9 seconds, and an identical interval separates the two events in frame B. In other 
words, under a Galilean transformation, the interval of time that separates two events 
has the same duration in all inertial frames. 


The identity of t and ¢’ in the Galilean transformation really amounts to the assertion 
that intervals of time have an absolute meaning, independent of the frame of reference. 
This is quite unlike speed, for example, which only has a meaning when expressed 
relative to a frame of reference. 


(ii) Consequence 2—the Galilean velocity transformation 


Another of the distinguishing features of the Galilean transformation is the rule that 
it provides for transforming velocities from one inertial frame to another. Such a rule 
is (not surprisingly) called a velocity transformation. The next ITQ and the discussion 
that follows will help to clarify the form and function of a velocity transformation. 


ITQ 6 Imagine that you are standing beside a motorway, observing two 
cars. One of the cars is travelling away from you at 70 m.p.h. while the other 
car travels in the same direction at 50 m.p.h. From your everyday experience, 
work out the speed of the faster car as measured by the driver of the slower car. 


It is possible to rephrase this kind of question in such a way that the role of a velocity 
transformation becomes clear. Suppose that you are at rest in an inertial frame A, 
and that the two cars mentioned in ITQ 6 are both travelling along the x-axis of that 
frame. The x-component of the velocity of the faster car is vy = 70 m.p.h., while the 


az 


frame A 


frame B 


slower car has speed V = 50 m.p.h. along the x-axis of frame A. The slower car can 
be regarded as being at rest in an inertial frame B that is in standard configuration 
with frame A (Figure 23). By using the frames A and B, the kind of question posed in 
ITQ 6 can be rewritten in the following way: 


30 


absolute time 


Figure 23 ITQ 6 can be rephrased in 
terms of two frames of reference in 
standard configuration. All the speeds 
shown are measured relative to frame A. 


Given the instantaneous velocity components (vy, vy, v,) of an object (the 
faster car) in an inertial frame A, what are the corresponding instantaneous 
velocity components (v;, v, v,) of the same object in an inertial frame B, 
where A and B are in standard configuration with relative speed V? 


When the question is asked in this way, it is clear that what is needed is a set of 
equations that express v;, v, and v; in terms of vy, Vy, v, and V. Such a set of equations 
is called a velocity transformation. You should be able to see that your answer to 
ITQ 6—an answer based on your everyday experience—may be obtained by sub- 
stituting v, = 70m.p.h, v, — 0, v,=0, and V = 50m.p.h. into the following 
velocity transformation: 


E (11) 


—0 


Equation 11 is called the Galilean velocity transformation because it is a direct 
consequence of the Galilean coordinate transformation of equation 10. Once the 
coordinate transformation between two frames is known, it is always possible to 
deduce the corresponding velocity transformation between the two frames. To show 
how this is done we will derive equation 11 from equation 10; but first attempt the 
following SAQ to make sure that you really understand what a velocity transforma- 
tion is. 


SAQ 11 Suppose that A and B are two inertial frames in standard con- 
figuration with relative speed V. A missile is travelling in the x-z plane of 


= of missile 


frame A 


<¥ 


frame A with speed ge. V, at 45? to the x-axis, as shown in Figure 24. Using 
the Galilean velocity transformation, answer the following questions: 


(a) What is the x'-component of the velocity of the missile in frame B? 
(b) What is the z-component of the velocity of the missile in frame B? 


(c) What is the angle between the path of the missile and the x’-axis in 
frame B? 


The Galilean velocity transformation is derived from the Galilean coordinate 
transformation in the following way: 


Consider a particle being observed in an inertial frame A and suppose that it travels 
from an event E, with coordinates (x, y,, z,, t;) to another event E, with co- 
ordinates (x5, y2, Z2, t2). (You can think of the particle as a cannon shell; E, is the 
explosion that launches the shell and E; marks the shell's detonation upon impact.) 
Figure 25 illustrates the motion. Now suppose that E, and E, are very close together. 
So close in fact that the velocity with which the particle travels from E, to E, is its 
instantaneous velocity according to an observer in A. (We are really dealing with the 
limit in which the separation between E, and E, vanishes.) The three components 
of the instantaneous velocity, as measured by an observer in A, are simply given by 
the distance moved in the appropriate direction divided by the time taken, everything 
being measured in A, naturally. 
X;—Xi aser 22— 24 


Thus p, = ———— Do v, = (12) 
* h- u *h—t 


Now let us repeat the process of finding the components of the instantaneous 


$271 UNIT 12 


Galilean velocity transformation 


Figure 24 A missile moves in the x-z 


plane, at 45? to the x-axis and with speed 


Jv. 


31 


frame A 


E; = (x1, yi zi, f1) 


E, = (x5, Y2, Z2, t) 


velocity, but this time from the point of view of an observer in frame B—a frame in 
standard configuration with A. The coordinates of event E, in frame B are (x1, yj, 
Zi» t4) as shown in Figure 26. These coordinates are related to the coordinates of E, 


in frame A by the Galilean transformation of equation 10. 


^z' 


frame B 


E» = (x2, Y2, 22, 6) 


E; = (xi, yi, Zi, fi) 


Similarly the coordinates of E, in frame B are: 


X2 = 
+ 

Jos 
Zo = 


t= 


xX, — Vt; 


> 
y! 


The components of the instantaneous velocity in frame B are 


Us 


x2 — X4 


v= 
tz ti í 


t3 — ti 


v. 


Z3 T 


"ph 


(13) 


(14) 


(15) 


Using equations 13 and 14 to eliminate all of the primed coordinates from equation 


15, it follows that 


Ü 


(x; — Vtz) — (x, — Vt) 


t= 
bh, —t, 

: e XXe) 
Le. Ux = 

tz — tı tz = 

X;,—x 
so v = 2 ey 

t2— tı 


32 


d yz 
=: 
oem y2 
X 
De y2 
X 


Sas 


P 


EV 


DT 


Vt 


= iF 


nas 


N^ 


N^ 


Figure 25 The initial and final events on 
the trajectory of a particle that moves 
through an inertial frame A. 


Figure 26 The same two events that are 
shown in Figure 25, but now as seen from 
an inertial frame B that is in standard 
configuration with frame A. 


4.4 


Now using equation 12 it can be seen that 
vu =n- V 


= (eq. 11) 
=v, 


= Vv. 


we w^ 


As you can see, the claim made earlier has been justified. Equation 11—the Galilean 
velocity transformation—has been deduced as a direct consequence of equation 
10—the Galilean coordinate transformation. This is an important point, because it 
shows that if the Galilean coordinate transformation is the correct solution to the 
central problem of relativity (Section 4.2), then the Galilean velocity transformation 
must be supported by experiment. Conversely, if it is found experimentally that the 
Galilean velocity transformation is not supported by experiment, we must conclude 
that the Galilean coordinate transformation is not the correct solution to the central 
problem of relativity. So we can test the coordinate transformation by testing the 
velocity transformation. 


We know from ITQ 6, and the discussion that followed it, that the Galilean velocity 
transformation agrees with our everyday ideas about relative velocity. However, 
those ideas come from our experience of fairly low speeds; typically, no more 
than 10?ms~‘. Such a limited range of everyday speeds cannot provide suff- 
cient evidence to ‘prove’ the validity of the Galilean velocity transformation, 
because physicists frequently have to deal with much greater speeds. In fact, we 
already know of one experimental result that conflicts with the predictions of the 
Galilean velocity transformation—a result that involves the speed of light. 


In Section 2.4 we saw that the speed of light relative to a detector is always c—in 
other words, the speed of light is independent of the frame in which it is measured. 
This result contradicts the Galilean rule for transforming any instantaneous velocity. 
To see the contradiction, consider two inertial frames, A and B, in standard con- 
figuration, with the origin of B moving along the x-axis of A at speed V = c/3. 
Suppose that a ray of light is travelling along the x-axis of frame A. Then according 
to an observer in frame A, the x-component of the speed of the light ray will be 
v, = c. It follows from the Galilean velocity transformation that, according to an 
observer in frame B, the speed of the light ray will be v, = v, — c/3 = 2c/3. Since 
this is a false prediction, we are forced to conclude that the Galilean coordinate 
transformation, despite its intuitive appeal, is wrong. It does not correctly describe 
the relationship between the coordinates of two inertial frames in standard con- 
figuration. We must look elsewhere for a solution to the central problem of relativity. 


Summary of Section 4 


Given the coordinates of an event in one frame of reference, a coordinate trans- 
formation is a set of equations that makes it possible to work out the coordinates of 
that same event as viewed from another frame of reference. 


By using a coordinate transformation it is possible to determine the results of 
experiments in one frame simply by transforming results obtained in another frame. 


By using a coordinate transformation it is possible to determine the form taken by a 
given equation when transformed into another frame of reference. 


If the coordinate transformation between any two inertial frames were known, it 
would be possible to check whether an equation expressed in terms of the co- 
ordinates of one inertial frame could be written in the same form in all other inertial 
frames. If it could, then that equation would be a possible law of physics. 


The central problem of relativity is to determine the coordinate transformation that 
links two inertial frames, A and B, in standard configuration (see fold-out page). 


The Galilean coordinate transformation is a plausible solution to the central problem 
of relativity. However, it leads to a prediction about the transformation of velocities 
that is in conflict with experiment. Thus, although the Galilean transformation is 
intuitively appealing and agrees with everyday experience, it is not the correct 
solution to the central problem of relativity. 


S271 UNIT 12 


33 


5.1 


Special relativity 
Einstein’s two postulates 


‘The relativity theory arose from necessity, from serious and deep contradictions in 
the old theory from which there seemed no escape. The strength of the new theory 
lies in the consistency and simplicity with which it solves all the difficulties using only 
a few very convincing assumptions.’ (Einstein and Infeld*) 


By the end of the nineteenth century it was well known that there was a serious 
problem with the electromagnetic theory of light. Maxwell’s theory predicted the 
correct numerical value of c, but it failed to explain why c was always the speed of 
light relative to its detector. A number of physicists tried to resolve this problem, 
mainly by modifying Maxwell’s theory. However, it was the 26-year-old Albert 
Einstein, who, while working as a clerk in the patent office in Bern, finally overcame 
the difficulty. Einstein realized that the problem arose from a contradiction between 
the constancy of the speed of light, and the Galilean coordinate transformation. 
Rather than trying to modify electromagnetic theory, Einstein adopted a different 
approach. He rejected the Galilean transformation and set out to find a replacement 
for it—a new coordinate transformation that would solve the central problem of 
relativity. 


To ensure that the new coordinate transformation would be consistent with the 
constancy of the speed of light, Einstein built the constant c into his theory from the 
very beginning. He achieved this by basing his mathematical deductions on the 
following two postulates: 


Postulate I—The principle of relativity 
The laws of physics can be written in the same form in all inertial 
frames. 


Postulate II—The principle of the constancy of the speed of light 
The speed of light (in a vacuum) has the same constant value c in 
all inertial frames. 


The first postulate and its implications were discussed in Section 3.5. The second 
postulate is a more precise restatement of the conclusion we arrived at in Section 2.4. 
All that has been done is to replace the rather loose idea of measuring the speed of 
light relative to a detector by the more limited notion of measuring the speed of light 
relative to an inertial frame. If you wish, you can think of the inertial frame as the 
one in which the detector is at rest. 


A few simple mathematical steps, not far beyond the level of this Course, led Einstein 
from the postulates to the new coordinate transformation**, and thus to a new theory 
of physics. This new theory is now known as the special theory of relativity—special 
because it only deals with observations made in inertial frames. It does not, for 
example, say anything about the relationship between two frames undergoing 
relative acceleration, since two such frames cannot both be inertial. (Non-inertial 
frames are part of the subject-matter of another of Einstein’s theories: the general 
theory of relativity.) 


In 1905 Einstein published his discoveries in a classic paper entitled ‘On the Electro- 
dynamics of Moving Bodies’. The paper consisted of two parts. The first half, called 
the kinematical part, included the derivation of the new coordinate transformation 
together with an investigation of its implications for time and space. Many of those 
implications are discussed in Sections 5.2 to 5.7 of this Unit. The second half of the 


* Quoted from The Evolution of Physics by A. Einstein and L. Infeld, published by Cambridge 
University Press. 


** A very simple, though rather lengthy, derivation of Einstein’s result can be found in Sir 
Hermann Bondi’s book Relativity and Common Sense, published by Heinemann in 1974. 
A more succinct derivation is given in Special Relativity by A. P. French, published by 
Nelson in 1968. 


34 


Einstein’s postulates 


special relativity 


general relativity 


5.2 


paper, called the electrodynamical part, presented an improved form of Maxwell's 
theory of electromagnetism. It confirmed much of Maxwell’s work, but rewrote the 
basic laws in such a way that they could be seen to take the same form in all inertial 
frames when transformed with the aid of the new coordinate transformation. This 
was a property that the original Maxwell equations did not have under the Galilean 
transformation. Thus it was that Einstein solved the riddle of the speed of light, and 
at the same time introduced a coordinate transformation that could be used in 
conjunction with the principle of relativity to reassess all the existing ‘laws’ of physics. 


The Lorentz transformation 


Starting from the two postulates of special relativity, Einstein found the following 
coordinate transformation linking the coordinates x, y, z and t assigned to an event 
by an observer in inertial frame A, with the coordinates x’, y’, z' and t' assigned to 
the same event by an observer in frame B, where A and B are in standard configuration 
(see the fold-out page). 


z x— Vt 
oo 
y? 
(== 
c 
E =y (16) 
ma 
t — (xV/c?) 
= a 
1 — 
c 


This transformation is of such importance that it is given a special name—the Lorentz 
transformation. Although deduced by Einstein in the 1905 paper, it was first written 
down in 1895 by the Dutch physicist H. A. Lorentz in connection with his attempt 
to modify the laws of Maxwell’s electromagnetic theory. 


You can easily see that the Lorentz transformation is very different from the Galilean 
transformation of Section 4.3. The new coordinate transformation contains c, the 
speed of light, a factor that was absent from the Galilean coordinate transformation. 
The presence of c is not surprising since the postulate of the constancy of the speed 
of light plays a fundamental role in special relativity. However, there are other 
differences. In particular, note the much greater complexity of the equation linking 


t' and t, and the fact that ./1 — (V?/c?) appears in two of the equations. What do 
these changes mean? What are their physical implications, and how can they be 
tested experimentally? These are important questions that will occupy the rest of this 
Unit, but before attempting to answer them you should get to know the Lorentz 
transformation by trying the following SAQs: 


SAQ 12 Suppose that two inertial frames, A and B, are in standard con- 
figuration with V — 3c/5. What are the coordinates in frame B of an event to 
which an observer in frame A assigns the coordinates x = 18 x 10° m, 
yz-—rm,—-z—0:—t-—.-I0s? 


SAQ 13 A rocket is fired from the origin of frame A at 45° to the x-axis, at 


time t = 0. It travels ina straight line at a speed of 2 x 1095 m s~ +, and emits 
a flash of light at times t = 1s, t —2sandt = 3s. 


(a) What is the x-coordinate of the rocket’s position each time it emits a 
flash of light? 


(b) Ifthe flight of the rocket is observed from a frame B that is in standard 
configuration with frame A, with V = 3c/5, what are the t' and x’-coordinates 
of each flash? 


SAQ 14 Given the coordinates (x, y, z, t) of an event according to an observer 
in frame A, equation 16 can be used to work out the coordinates (x', y’, z’, t") 
assigned to that same event by an observer in frame B. However, it would be 
equally useful to be able to work out the coordinates that the observer in 
frame A would assign to an event, if the coordinates of that event according to 
the observer in frame B were given. By rearranging equation 16 show that 


Lorentz transformation 


S271 UNIT 12 


35 


5.3 


x= 
y? 
"e e 
c 
y=y' 
: (17) 
Z=2Z 
RL 
ge 
Eg 


The next five subsections explore five direct consequences of the Lorentz transforma- 
tion. Taken together they provide a new view of space and time. 


Special relativity at low speeds 


In Section 4.3 you saw that the Galilean coordinate transformation led to results 
that agreed with many everyday notions about the way in which coordinates and 
velocities should transform between different frames of reference. However, you also 
saw that experiments with light showed that the Galilean coordinate transformation, 
despite its intuitive appeal, was not an acceptable solution to the central problem of 
relativity. Einstein claimed that the correct solution was given by the Lorentz 
transformation, which is obviously quite different from the Galilean coordinate 
transformation. If the Lorentz transformation is indeed the correct solution to the 
central problem of relativity, how does it explain your everyday observations, which 
seem to agree with the Galilean coordinate transformation? 


The answer is really quite simple. In everyday life you almost never encounter objects 
that move at speeds greater than 10 °c relative to you. (10 °c is very roughly ten 
times the speed of sound.) When dealing with coordinate transformations between 
frames that have a relative speed of 10^ 5c or less, the Galilean coordinate transforma- 
tion and the Lorentz transformation are almost indistinguishable. In other words, 
everyday experience involves speeds that are so small compared with the speed of light 
that the differences between the Galilean and Lorentz transformations are insignificant. 
When driving in your car, even if you break all the speed limits, you will never see 
any evidence that the correct solution to the central problem of relativity has to be 
the Lorentz transformation rather than the Galilean coordinate transformation. 


It is easy to see that the Lorentz and Galilean transformations are almost the same 
at low speeds by simply substituting V = 10 °c into equation 16 and examining the 
consequences. When V = 10 °c or less, the Lorentzian factor 1/,/1 — (V?/c?) lies 
between 1 and 1.00000000005. Under such conditions it is clearly reasonable to 
make the following approximations to the Lorentz transformation 


- yt 
za cns (18) 
y? 
Ess 
c 
y-y (19) 
v (20) 
t— xV/e? 
pol HM M Q1) 
y? 
da 


As you can see, in this low-speed approximation, equations 18, 19 and 20 immediately 
take the form ofthe first three equations in the Galilean transformation (equation 10). 
As for equation 21, the distance x can be thought of as a typical distance involved in 
everyday observations, usually no more than a few kilometres. This implies that if 
V is 10 °c or less, then the term xV/c? in equation 21 is negligible. So, for low speeds 
(less than 1075c) a good approximation to the Lorentz transformation is given by 


xx= Vt 

y =y 

: (22) 
zZz =Z > 

t =t 


36 


Do not confuse this with the Lorentz 
transformation given in equation 16. 


5.4 


This is just the Galilean coordinate transformation of equation 10. So, as far as the 
low-speed observations of everyday life are concerned, the Lorentz transformation 
and the Galilean coordinate transformation are indistinguishable. Thus, the Lorentz 
transformation also accords with the common experience of everyday life. 


In order to distinguish between the two kinds of coordinate transformation, it is 
necessary to perform experiments that involve high speeds (e.g. experiments with 
light itself). It is only in such experiments that the ubiquitous Lorentzian factor 
14/1 — (V?/c?) becomes significantly different from 1. Figure 27 shows the way in 
which the Lorentzian factor increases as V is increased. You should be able to see that 


14/1 — (V?Je?) is only significantly different from 1 when V is an appreciable 
fraction of the speed of light. 


The nature of time in special relativity 


The Galilean concept of absolute time, summarized by the relation t = t', is totally 
rejected by special relativity. Time becomes a relative quantity, depending on the 
frame of the observer. This can be seen from the complicated relationship between t' 
and t in equation 16. The value of t’ not only depends on t but also on V and even 
on x. The relative nature of time explains why the event that was considered in SAQ 12 
did not occur at t = 10 s according to the observer in frame B. 


One of the most dramatic consequences of the relative nature of time is that the 
order in which two events occur can, in certain circumstances, depend on the frame 
of reference of the observer. To see that this effect does indeed follow from the Lorentz 
transformation, consider the following example. 


Suppose that an observer in frame A assigns the following coordinates to two events 
E, and E;. 


E; x,—42x 108m, y, =0, z;—0; f= 7s 


E; x,=30x 108m, y;—0, z, —0, t;— ls 


It is quite clear that to someone using frame A, event E; occurs before E,; it happens 
at an earlier time. 


To show that the order of the two events is not the same in all frames of reference let 
us use the Lorentz transformation to work out the coordinates of E, and E; as seen 
from a frame B in standard configuration with frame A. If the relative speed of A and 
B is V = 4c/5, you can check for yourself, by using equation 16, that according to an 
observer in frame B the coordinates of the two events are 


Eo, = =8 = Hm 7, = 0-7 = 0. H-4s 
E; x =6x 108m, y,=0, z; 20, H=¥s 


So in frame B, event E, occurs after E;: it is E, that happens at the later time. In 
other words, the order of the two events in frame B is opposite to the order in which 
they occurred in frame A. 


This change in ordering is potentially very disturbing. Think of what would happen 
if event E, represented pushing the plunger on a detonator, and E; was the ensuing 
explosion. While an observer in frame A would see the detonation take place after 
the plunger had been pressed, an observer in frame B would see the explosion take 
place before the plunger had been pressed! 


Fortunately, such a situation can never arise. If you look at the coordinates of E, 
and E, in frame A, you will see that the distance between them, x; — X4, is greater 
than the distance that light can travel during the period of time that separates them, 
c(t; — t,). In mathematical terms 


xz — X, > C(t, — t4) 


An analogous statement is true in frame B, in which the spatial separation of the two 
events is x} — x, and the distance that light can travel during the period of time that 
separates the two events is c(t, — t5). 


Provided that nothing travels faster than light, it is not possible, in either frame, for 
the two events to be connected by any kind of signal that travels from E, to E,. Such 
a signal simply would not be able to travel fast enough to link the two events in either 
frame. Therefore, provided that nothing travels faster than light, E, cannot have 
been the cause of E, in frame A, nor can E; have been the cause of E, in frame B. In 


S271 UNIT 12 


1 
y? 
pe 
m 
2.5 i 
| 
| 
| 
| 
2.0 | 
| 
| 
| 
| 
| 
1.5 | 
| 
| 
| 
| 
1.0 | 
| 
| 
| 
| 
0 rr nn H 
0 02 04 06 08 1.0 


Vic 


Figure 27 The Lorentzian factor 


1//1 — (V?/c?) plotted against V/c. 


37 


fact, it can be shown that, provided V is less than c, and provided no signal travels 
faster than light, there does not exist any frame B in which E, could have caused E, 
or vice versa. This being the case, the re-ordering of events E, and E, does not lead 
to any physical paradox, since the two events must be completely independent in all 
frames if they are independent in frame A. In order to ensure that there are no 
paradoxical situations in which effects precede their causes we will henceforth assume 
that the speed of light is the maximum speed at which a signal can travel.* 


If the separation of the two events E, and E, had been such that a signal, travelling 
at the speed of light or slower, could have passed between them, then one could have 
caused the other. In such cases the Lorentz transformation can be used to show that 
although observers in different frames may assign different coordinates (space and 
time) to each of the events, they all agree on the order in which events happen. (This 
assumes that no observer travels faster than light relative to another observer, but 
that agrees with our assumption that no signal travels faster than light.) So if E, 
had been able to cause E,, no observer, no matter how fast he travelled (provided it 
was less than c) would observe E; to happen before E,. Special relativity does not 
allow an explosion to occur before the pressing of the plunger that caused it. 


Another interesting, though less spectacular, consequence of the relative nature of 

time is known as the relativity of simultaneity. This means that two events which are relativity of simultaneity 
simultaneous according to one observer (ie. happening at the same time in his 

frame) may occur at two different times according to another observer, provided he is 

moving relative to the first. 


Einstein used the relativity of simultaneity to provide a convincing demonstration of 
the relative nature of time. Think back to the example of a train passing through a 
railway station, used to introduce the idea of standard configuration in Section 4.2. 
Figure 28 shows an observer A standing on a station platform watching a passing 


Figure 28 An observer A sees two 
lightning bolts strike the endpoints X and 
Y ofa carriage moving past. The sole 
occupant of the carriage is an observer B 
at its centre. 


carriage. Another observer B is positioned at the centre of the carriage. Two lightning 
bolts strike the ends of the carriage and produce two flashes of light, just as the 
midpoint of the carriage is passing A. Since the light flashes originate at equal 
distances from A, he sees the two lightning bolts strike simultaneously. In fact, both 
lightning bolts struck at the same time in his frame; they were simultaneous in his 
frame. Now, observer B is moving towards the flash that originates at point Y, and 
away from the flash at point X. So, allowing for the finite speed of light, the Y flash 
will reach B before the X flash. However, B knows that all light signals travel at the 
same speed c, so being equidistant from the two ends of the carriage he interprets the 
fact that the flash from Y reaches him first as meaning that a lightning bolt struck Y 
at an earlier time than a lightning bolt struck X. So, according to B the two lightning 
bolts did not strike his carriage at the same time; they were not simultaneous in his 
frame. Thus, two events that are simultaneous in the frame of observer A are not 
necessarily simultaneous in the frame of observer B. Simultaneity is a relative concept. 


* Hypothetical particles which travel faster than light have been sought experimentally. No 
evidence has been found to show that such particles—called tachyons—actually exist. But 
even if they are discovered in the future, it is possible that they might have special properties 
which would prevent them from leading to any paradox of the plunger/explosion kind. 


38 


5.5 


SAQ 15 According to an observer in frame A, two events e, and e; have 
different positions but occur at the same time. Their coordinates are 


ex —H2 x 10m y, — 0-57 et 75 


62: X5 — 18 102m: y, 0) 2, SUN He 8 


(a) What are the time coordinates of these two events according to an 
observer in a frame B that is in standard configuration with A and moves with 
speed V = 4c/5 along the x-axis of A? 


(b) Are the two events simultaneous according to the observer in B? 


SAQ 16 Using the coordinates of the events e, and e; in frame A, as given 
in SAQ 15, determine whether or not it is possible to find a frame D in which 
event e, could have caused event e,. (You should assume that D is in standard 
configuration with A, with V less than c.) 


The velocity transformation of special relativity 


The Galilean coordinate transformation failed to solve the central problem of 
relativity because it led to incorrect predictions about the way in which two different 
observers would measure the velocity of an object. In particular, you saw in Section 
4.3 that if the Galilean transformation were correct then, contrary to experiment, the 
speed of light would not be the same in all inertial frames. The way in which Einstein 
derived the Lorentz transformation, using the postulate of the constancy of the speed 
of light, ensures that special relativity will not suffer from this problem. In this Section 
we will deduce the rule for transforming velocity components between different frames 
of reference, and see exactly how special relativity guarantees that any light beam 
will have the same speed relative to any observer in an inertial frame. 


Consider the usual two frames, A and B, in standard configuration, as shown on the 
fold-out page. Imagine that a particle is travelling along the x-axis of frame A and 
passes through the point x, at time t,, and through x; at time t; (Figure 29). If the 


frame A 


=v 


frame B 


two points are very close together (consider the limit as their separation vanishes) 
then according to an observer in frame A, the x-component of the instantaneous 
velocity of the particle is 


v, =—2—+ (23) 


An observer in frame B watching the same particle move between the same two points 
would describe its motion using his coordinate system in which the points are given 
the coordinates x; at t and x5 at t}. According to the observer in B the x’-component 
of the velocity of the particle is 
E 
Ss (24) 


iet 


Our objective is to use the Lorentz transformation to work out a relationship between 


v, and v}; a velocity transformation that will enable us to take the components of the 
velocity of the particle as observed in frame A, and work out the components of the 
particle’s velocity according to an observer in frame B. Using the Lorentz trans- 
formation of equation 16, it is easy to see that: 


S271 UNIT 12 


Figure 29 Two points on the trajectory 
of a particle moving along the x-axis of 


frame A (and the x’-axis of frame B). 


velocity transformation 


39 


x MVE, t. t, — Ga V/c?) 
i= i= 
1 = 1 ie 
c c 
x. x, — Vt; 1 t; — (x; Vjc?) 
2= t2 
1 r 1 = 
c c 


Substituting these four pieces of information into equation 24 gives 
(x; — Vt;) — 64 — Vt) 
(t2 — (x2 V/c?)) — (t, — GG V/c?)) 


i$ <= (x; — x1) — V(t, — ty) (26) 


V 
(t; — t) — as — X4) 


/ 


D= 


(25) 


If both the numerator and the denominator of this last equation are divided by a 
factor of (t, — t,), we get 


(x; — xi) = (t; t4) 

= (t; — t) (t2 — t) 
= (t; — tı) = V (x; — xı) Qn 

(t4—1) €^ (t —t) 


By using equation 23 this can be rewritten as 


(28) 


This is exactly the rule we set out to look for. It tells us that if an observer in A sees 
some object that moves with velocity component v, along the x-axis, then it is an 
inevitable consequence of special relativity that an observer in B will find that the 
velocity component v; of that object along the x'-axis is exactly that given by 
equation 28. 


ITQ 7 (Optional) By using a method similar to that employed in the 
derivation of equation 28, it is possible to obtain the transformation law for the 
velocity components v, and v,. The only difference is that the derivation 
begins by supposing that the particle is travelling in an arbitrary direction in 
frame A with velocity (v,, v,, v,). Using this method, show that 


za. ad (ca 
fs 


= 29 
» i Vv, cm 

c 

1 == y? 2 

and gc i — (30) 

U 

1 = x 

c 


Equations 28, 29 and 30 make up the velocity transformation rule of special relativity. 
Unlike the Galilean velocity transformation, it has the property that anything that 
moves with speed c in one inertial frame will also move with speed c in any other 
inertial frame. A simple example will show how this remarkable property comes 
about. 


Suppose that in an inertial frame A a light signal is observed travelling along the 
x-axis. Naturally, such a signal has velocity components v, = c, v, — 0, v, — 0. 
To work out the velocity components of the light signal in frame B, which is in 
standard configuration with A, all we have to do is to use the velocity transformation 
equations (equations 28, 29 and 30), which give 


Ü 


c—V c—V 


Uy = m =C 
c 
= (per) 
c c 
v, =0 
v, — 0 


5.6 


Thus the speed of the light signal in frame B is still c, despite the fact that frame B is 
moving along the x-axis of frame A with speed V. Of course, this is just the result you 
should have been expecting. Einstein introduced the Lorentz transformation precisely 
because he knew that it implied that the speed of light has the same constant value c 
in all inertial frames. 


Before leaving the subject of the velocity transformation of special relativity, it is 
interesting to note the following fact. If both V and v, are much less than the speed 
of light, then the term (Vv,/c?) in equation 28 may be neglected, and we recover the 
Galilean rule that everyday experience confirms: 


v,=v,—V (eq. 11) 


In fact we used this relationship in our discussion of cars at the beginning of Section 
2.1. Now you can see that we were only using a good approximation to the correct 
law for working out the relative speed of two cars. This is just another example of the 
way in which low-speed special relativity is indistinguishable from Galilean relativity. 


SAQ 17 Consider a relativistic gnat, flying directly away from the mouth of 
a hungry frog at half the speed of light. Suppose that the frog extends its tongue 
at three quarters of the speed of light. If the gnat cared to measure the speed 
of the frog’s tongue, what value would it obtain? 


SAQ 18 Suppose that the gnat of SAQ 17 is flying directly towards the 
frog’s mouth at half the speed of light. Once again, suppose that the frog 
extends its tongue at three quarters the speed of light towards the gnat. If the 
gnat measured the speed of approach of the tongue, what result would it 
obtain? (Hint: remember that velocity components can be positive or negative 
according to their direction.) 


Lorentz contraction 


One of the most widely known predictions of special relativity is that moving rods 
contract along the direction of their motion. However, there is an almost equally 
widespread misunderstanding of that statement. This Section is intended to ensure 
that you have a clear understanding of the effect of motion on measurements of 
length. 


In everyday life, when asked to measure the length of an object, we are usually only 
expected to deal with an object that is stationary. For such objects—those at rest 
relative to the observer—we can safely say that the length is just the distance between 
the two ends. The problem of measuring the length of a rapidly moving object is more 
difficult. A very sensible definition of length is adopted in special relativity, one that 
is compatible with our normal use of the word and yet one that can be applied to 
bodies moving in a given frame of reference. The length of an object ina given frame 
is defined to be the distance between the two events that mark its endpoints at a single 
time in that frame. An example should help to clarify this definition. 


Consider the following method for determining the length of a rod moving with 
speed V along the x-axis of an inertial frame A. A large number of men are lined up 
along the x-axis of A. Each man knows his x-coordinate, which is fixed, and has a 
clock that has been synchronized with all the other clocks by means of the light signal 
procedure discussed in Section 3.2. The men have all agreed that they will measure 
the length of the moving rod at a specific time, t = 10s say, in their frame of 
reference—frame A. The plan is that when each man’s clock shows t = 10s, he will 
look at the point on the x-axis at which he is located, and if one of the endpoints of 
the rod is passing him he will raise his arm. Figure 30 shows the outcome of the 
measurement. As you would expect, only two men have their arms raised; after all, 
the rod only has two ends, and they must both be somewhere at t = 10s. According 
to the definition of the length of a moving object, the length of the moving rod as 
measured in frame A is simply the distance between the two men who raised their 
arms, i.e. the difference in their x-coordinates. 


Of course, we would not have gone to all this trouble to define the length of a moving 
rod if there had not been something peculiar (i.e. non-Galilean) about it. The fact 
of the matter is that the length that is found by the method just described depends on 
the speed of the rod relative to those performing the measurement. In other words, 
length, like time, speed and velocity, is a relative quantity—it depends on the frame 


S271 UNIT 12 


41 


frame A 


of reference of the observer. Let us now prove that the relative nature of length is a 
direct consequence of the Lorentz transformation. 


Figure 31 shows a rod fixed along the x'-axis of an inertial frame B, which is in 
standard configuration with frame A. When viewed from A, such a rod is seen to 
move along the x-axis with speed V, just like the one we discussed before. Suppose 
that one end of the rod is fixed at x, = 0 and the other is fixed at x; = Ly. (By fixed 
we mean it is at that position for all values of t’ according to an observer in B.) Any 
attempt to measure the length of the rod in B, the frame in which it is at rest, will yield 
the result x, — x, = Lo. 


frame A 


frame B 


rod fixed 
in frame B 


In order to work out the length L of the rod as measured in frame A, we need to know 
the positions of both its endpoints at one particular time in frame A. Note that the 
phrase ‘one particular time’ now means at a single value of t, since it is a measurement 
in frame A that interests us. We are free to choose any value of t we want; the length 
of the rod does not depend on when we make the measurement. Let us take t = 0 for 
simplicity, though any other value of t would do just as well. Substituting t = 0 into 
equation 16 shows that, according to the Lorentz transformation, the x and x’ 
coordinates of an event that occurs at time t = 0 are related by 


5 x 
KA 
y? 
jore 
c 
y? 
ie. eee d (31) 


42 


Figure 30 When their clocks show 

t = 10s, the two men who find themselves 
at the ends of the moving rod raise their 
arms. The length of the moving rod in 
frame A is just the distance L between 
the two men who have raised their arms, 
L being measured in frame A. 


Figure 31 A rod is fixed between the 
points x? = 0 and x; = Lọ in frame B, 
which is in standard configuration with 
frame A. When viewed from frame A, 
the rod moves along the x-axis with 
speed V. 


5.7 


This equation can be used to work out the x-coordinates of the two endpoints at 
t= 0: 


For the endpoint at x, = 0, we get x, = 0. 


y? 
For the endpoint at x; = Lo, we get x; = Lo [1 -—- 
N c 


We now know the x-coordinates (x, and x;) of both the endpoints at a single time 
in frame A. Because we have defined the distance between the two endpoints (as 
measured in A) at a single time t to be the length L of the rod as measured in A, we 
can conclude that 


(32) 


Thus, since ,/1 — V?/c? is less than 1, it follows from equation 32 that L is less than 
Lo. In other words, the measured length of the rod is smaller in frame A, where the 
rod moves, than in B where the rod is at rest. 


This effect is known as the Lorentz contraction, and it is the origin of the much looser 
statement ‘moving rods contract along the direction of their motion’. You should 
always remember that the Lorentz contraction is a result of the relative motion of an 
object and the observer who is measuring its length. If an observer were to travel with 
the rod in frame B, he would certainly not observe any contraction in its length. The 
rod would not be moving relative to the observer, so it would always have length Lo 
according to him. On the other hand, if an observer moving with the rod in frame B, 
measured the length of a rod that was at rest in frame A, he would observe a length 
contraction because a rod at rest in frame A would be moving relative to an observer 
in frame B. There is a beautiful symmetry between the two frames. Any observer finds 
that a rod moving relative to his frame is contracted. 


One common misconception must be cleared up before we leave the subject of con- 
tracting rods. It is often said that the moving rod appears to be contracted. This 
statement is much too imprecise. The use of the word ‘appears’ gives the impression 
that the Lorentz contraction is a kind of optical illusion, and that is certainly not the 
case. In relativity we must rely on measurements—only measurements are physically 
meaningful. We defined what we meant by a *measured length' and that certainly 
depends on the speed of the rod relative to the observer. The contraction is a physical 
effect concerning measured lengths, and not just an optical phenomenon.* 


SAQ 19 Cosmic rays are highly energetic elementary particles (many are 
protons) that bombard the Earth from outer space. Suppose that a cosmic ray 
travelling directly towards the Earth with speed 0.996c is observed by a detector 
on a balloon at an altitude of 10 000 m. What is the distance between the 
balloon and the ground as measured in the frame of reference in which the 
cosmic ray is at rest? 


Time dilation 


Just as moving rods contract, so, it is often said, ‘moving clocks run slow’. Once 
again, this is a rather loose statement and it needs clarification if it is to be properly 
interpreted. 


Figure 32 shows a row of identical synchronized clocks, lined up along the x-axis of 
an inertial frame A. The clocks have been synchronized by means of the light signal 
procedure described in Section 3.2, and they all show the value of t, the time according 
to an observer in frame A. Another clock, identical to those in frame A, is fixed at the 
origin of an inertial frame B that is in standard configuration with frame A. The clock 


* It is quite easy to show that if a photograph is taken of a rod moving rapidly towards a 
camera the rod will actually appear to be longer than it really is (i.e. longer than its measured 
length). This is an optical effect and it is quite independent of the Lorentz contraction of the 
measured length of the rod. 


Lorentz contraction 


cosmic ray 


S271 UNIT 12 


43 


clock fixed in frame A 
showing t = T 


i © 
synchronized clocks 
fixed on the x-axis 


clock fixed at the origin 
of frame A 


of frame B, showing t = T" 


D 


3,8 


at the origin of B shows the time f’, the time coordinate belonging to frame B. Because 
frames A and B are in standard configuration, the clock at the origin of B will move 
along the x-axis of frame A with constant speed V, successively passing each of the 
clocks fixed in frame A. Another consequence of the standard configuration of A 
and B is that when the clock at the origin of B shows t’ = 0, it will be passing the 
clock at the origin of A, which will be showing t = 0. (If you need to remind yourself 
of the definition of standard configuration, consult the fold-out page.) 


We will now use the arrangement of clocks shown in Figure 32 to demonstrate that 
intervals of time are relative quantities—their duration depends on the motion of the 
observer who measures them. It is quite easy to see how this can be done. Look at the 
clock in frame B in Figure 32; according to an observer in frame A it takes a time T 
for the clock in B to travel from x = 0 to x = VT. How long does this journey take 
according to an observer in frame B? Well, we know that when the clock in frame B 
was at the point x = 0 it showed that the time was t’ = 0. So, if we want to know how 
long it took, according to an observer in frame B, for the clock in B to move from 
x = 0 to x = VT, all we have to do is work out the reading on the clock in frame B 
when it reaches the position shown in Figure 32. Inevitably, in order to do this we 
must use the Lorentz transformation given in equation 16. We will have to use the 
last part of the Lorentz transformation—the equation that relates the t- and t'-co- 
ordinates of an event. 


7 t— CV /c) 
"en 63 
Lu 


The event in question is the coincidence of the clock at x’ = 0 in frame B with the 
clock at x — VT in frame A, as shown in Figure 32. We know that according to an 
observer in frame A the event occurs at t = T, and x = VT. By substituting this 
information into equation 33 we can work out the t'-coordinate of the coincidence 
event according to an observer in frame B. In other words, we can work out the 
reading T" shown on the clock in frame B. 


ieee 
T — (TV?/c?) ( -) 


= 
Lo 

y? V? 

Mes we 

c? c? 


ie. (34) 


Because \/1 — (V?/c?) is less than 1, the reading T’ on the clock in frame B is less 
than the reading T on the corresponding clock in frame A. In other words, according 
to an observer in frame B, the amount of time required for the clock in frame B to 
travel from x = Oto x = VT is less than the time required for the same trip according 


44 


Figure 32 A clock fixed at the origin of 
frame B moves along the x-axis of frame 
A with speed V. At time t = Tin frame 
A, the clock in frame B has reached the 
point x — VT. 


to an observer in frame A. So intervals of time are indeed relative—their duration 
does depend upon the frame in which they are measured. 


Another way of expressing this result comes from noting that all of the clocks in 
frame A are synchronized, and that the clock in frame B agreed with one of the clocks 
in frame A at t = t’ = 0. The fact that T" is less than T forces us to conclude that 
the moving clock is running more slowly than the clocks in A. This tendency of a 
moving clock to run slow is called the time dilation effect. 


It is important to realize that time dilation is not some kind of mechanical defect 
that affects moving clocks. It is a direct consequence of the Lorentz transformation, 
and it represents a slowing down of time itself in frame B relative to frame A. The 
clock is simply a device that we use to make this clear. If an observer had travelled 
with the clock in frame B he would not have seen anything strange happen to the 
clock. The hands would not have appeared to move more slowly than usual. The 
dilation is a property of time itself; both the clock and the rate of all the life processes 
of the observer would slow down relative to the clocks in frame A. 


Now think about the following problem. In describing the time dilation effect we 
have sometimes used the term ‘moving clock’ to mean the clock in frame B. But 
surely this is rather biased—as all speed is relative would it not be perfectly fair to 
reverse the arguments given above and treat the clock in B as the stationary clock 
while the row of clocks in A move with constant speed? Furthermore, if we did think 
of things in this way, would not the Lorentz transformation lead us to expect that 
it was the clocks in A that were running slow? 


The answer to both of these questions is yes! Speed is relative, so we can think of A 
as ‘moving’, and we would expect that the clocks in A run slow when observed from 
B. However, amazing as it may seem, this does not conflict with the fact that T’ is, 
as was shown above, definitely less than T. The reason is very interesting and shows 
the care that is sometimes needed when dealing with special relativity. We know that 
observers in frame A synchronize their clocks by exchanging light signals. Any ob- 
server in A would be quite convinced that all the clocks in A were properly syn- 
chronized at all times, since they are all at rest relative to each other. However, an 
observer in B would not agree that the clocks in A had ever been properly synchronized. 
He would believe that they had all been started out of step! To convince yourself of 
this, all you have to do is ask whether the clocks fixed at x = 0 and x = X in frame A 
show identical values of t, when viewed by an observer in frame B at a given value of 
t' (Figure 33). This amounts to asking whether a line of clocks that are synchronized 
in frame A will also be synchronized in frame B. The fact that the answer is no follows 
from a simple application of equation 33. At t' = 0, the clock at rest at x = 0 shows 
that the time is t — 0. On the other hand, by rearranging equation 33 it is easy to see 
that at ' = 0 the clock fixed at x = X shows that the time is t = XV/c?. 


frame A 
V frame B 


"o 


Thus, when viewed from frame B, the clocks in frame A are not synchronized. The 
situation shown in Figure 32 is redrawn from the point of view of the observer in 
frame B in Figure 34. According to the observer in B, each of the clocks fixed in 
frame A is running slow compared with his own clock. But, in addition, each of the 
clocks on the x-axis of frame A is out of synchronization with all of the others. The 
Observer in B is therefore not surprised that the reading T is greater than T". Even 
though he believes that the clock fixed at x — VT in frame A runs more slowly than 


S271 UNIT 12 


time dilation 


Figure 33 At /' = 0, the origins of 
frame A and frame B coincide and the 
clock at the origin of frame A shows 

t = 0. But what is the reading on the 
clock at x = X? (This Figure has been 
drawn from the point of view of an 
observer in frame B. From such a 
viewpoint, frame A is moving in the 
negative x’-direction with speed V.) 


45 


Az Figure 34 The arrangement shown in 
Figure 32, but now drawn from the 


frame A 
viewpoint of an observer in frame B at 
az! V time /' = T’. Frame A is the ‘moving’ 
frame and the clocks along the x-axis of 
XVn clock fixed at the frame A are not synchronized according 
a origin of A showing to the observer in B. Thus, although the 
xV clock fixed at x = VT shows a reading T 


areadingT— 7 which is greater than T", it is still the case 


that the clocks in frame A are running 
slow relative to the clock in frame B. 


clock fixed at x = VT 
showing a reading T ——_ 


<y 


clock fixed in frame B 
showing t' = T' 


these clocks are not 
synchronized from the 
point of view of B 
frame B 


D 


x 


his own clock, he also believes that the clock at x = VT had a head start compared 
with his own clock. In special relativity, all observers agree that a moving clock runs 
slow. 


As you can see from this and the preceding Section, there is a complete and remark- 
able symmetry between all observers who use inertial frames. They would all agree 
that moving clocks run slow, and they would all agree that moving rods contract. 
There is no phenomenon that will permit an observer to conclude that he is the one 
who is ‘really’ moving. That is not really a surprising result because such a 
phenomenon would conflict with the principle of relativity (Section 5.1), and that 
principle was one of the postulates that led Einstein to the Lorentz transformation. 


SAQs 20 and 21 lead to a well-known problem—the so-called twin paradox. If you twin paradox 
want to learn about it, make sure that you read the answer and comment to SAQ 21. 


SAQ 20 The distance to the triple star system Alpha Centauri is such that 
it takes light 4.2 years to travel from that system to the Earth. If an astronaut 
left the Earth and travelled at a constant speed of 0.9c relative to the Earth 
towards Alpha Centauri, how long would it take, according to the astronaut, 
for him to reach there? 


SAQ 21 Suppose that the astronaut of SAQ 20 turns his spacecraft around 
as soon as he arrives at Alpha Centauri and returns to Earth, travelling at 0.9c 
relative to Earth throughout his voyage. How long would the round trip have 
taken: 


(a) according to the astronaut? 


(b) according to the astronaut’s twin brother, who stayed on Earth through- 
out the voyage? 


Experimental support for special relativity 


Time dilation is one of the few relativistic effects that can be demonstrated in a 

fairly simple experiment. TV12 includes film of such an experiment. It shows that 

a certain kind of elementary particle, known as a muon, which has a half life of muon half life 
1.5 x 10 Ós, experiences time dilation as it travels through the Earth's 

atmosphere at very high speed. The programme also includes a discussion of the 

two postulates of special relativity. 


Study comment The Notes for TV12 are contained in a separate booklet, 
büt you should regard them as part of this Section. After watching the TV 
programme and reading the Notes you should be able to answer the following 
SAQs: 


SAQ 22 The half life of the neutron is approximately 648 s. Suppose that at 
some particular instant 10° neutrons could be created in a laboratory. If all 
of the neutrons were at rest in the laboratory, roughly how many of them 
would remain after 84 minutes? 


46 


5.9 


6.1 


SAQ 23 The Sun isa source of high-speed neutrons, some of which reach the 
Earth. Suppose that 10° neutrons, each with a speed of 0.95c, leave the Sun 
and travel towards the Earth. If, according to an observer on the Earth, each 
neutron requires 84 minutes to travel from the Sun to the Earth, roughly how 
many of the neutrons will reach the Earth? 


Summary of Section 5 


Einstein was able to use two postulates—the principle of relativity and the principle 
of the constancy of the speed of light—to deduce a solution to the central problem of 
relativity. Einstein’s solution was the Lorentz transformation (see fold-out page). 


The use of the Lorentz transformation leads to the following results: 


(a) For small values of V/c, the Lorentz transformation gives approximately the 
same results as the Galilean coordinate transformation. 


(b) Simultaneity is relative. Two events that occur simultaneously in one frame of 
reference are not necessarily simultaneous in all other frames of reference. 


(c) The order of events is relative. The order in which two events occur depends on 
the frame in which they are observed, provided one event could not have caused the 
other. 


(d) The velocity transformation of special relativity (equations 28, 29 and 30) is 
such that the speed of light is the same in all inertial frames. 


(e) Length is a relative quantity. The length of a rod depends upon the frame in 
which it is measured. (“Moving rods contract’.) 


(f) The duration of an interval of time is a relative quantity. The rate at which a clock 
ticks depends upon the frame in which it is measured. (Moving clocks run slow’.) 


Relativistic physics 


The scope of relativistic physics 


Once Einstein had formulated the principle of relativity, and had shown that the 
Lorentz transformation was the coordinate transformation that linked inertial 
frames, it became possible to test all of the so-called ‘laws’ of physics, to see whether 
or not they took the same form in all inertial frames. Any so-called ‘law’ that did not 
retain its form when transformed by means of the Lorentz transformation could not 
really be a law of physics at all, no matter what it was called. 


Using this simple test, Einstein found that Maxwell’s laws of electrodynamics were 
basically sound. After reinterpreting some of Maxwell’s ideas and disposing of the 
ether (Section 2.3), Einstein found that the Maxwell equations did preserve their 
form under Lorentz transformation. The Maxwell equations did take the same form 
in all inertial frames, so they were possible laws of physics. However, other parts of 
physics did not fare so well. 


Mechanics, in particular, had to be modified to a considerable extent to ensure that 
its laws took the same form in all inertial frames. Newton’s second law, in the form 
F = ma, is not a law at all in the relativistic sense, nor is Newton's law of gravitation. 
Section 6.2 describes the changes required to bring mechanics into line with the 
demands of the principle of relativity. 


In some parts of physics the modifications demanded by relativity were easy to 
implement. For example, Einstein and Planck were able to write down the basic laws 
of relativistic thermodynamics within two years of the original presentation of special 
relativity. In other parts of physics, such as quantum field theory (Unit 15), the correct 
relativistic laws are still a matter of debate. Of course, in some fields, especially those 
that do not involve high speeds or great precision, equations that do not take the 
same form in all inertial frames are still used as ‘laws’ even though it is accepted that 


they cannot be exactly true. However, there are fields, such as the study of elementary 
particles or the theory of gravitation, in which laws that do not satisfy the principle 


of relativity are rarely taken seriously. 


S271 UNIT 12 


47 


6.2 


6.2.1 


Relativistic mechanics 


Study comment You will probably find it helpful to have Unit 3 with you 
while reading this Section. 


In searching for the correct laws of relativistic mechanics, it makes more sense to try 
to modify the highly successful Newtonian laws rather than to scrap them altogether 
and start from scratch. So in what follows we will repeat, as far as possible, the 
development of Newtonian mechanics given in Unit 3. However, our starting point 
will be a new definition of momentum, and, because we start from a new definition of 
momentum, we will end up with a new concept of energy. 


Relativistic momentum 


In relativistic mechanics the relativistic momentum of a particle of mass m (where 
m # 0) and velocity v is defined by the following vector equation: 


(35) 


The mass m that appears in this formula is just the usual mass of the particle—the 
Newtonian mass that would be determined in a low-speed experiment. However, 
just as we had to take great care over the definition of the length of a moving rod in 
Section 5.6, so we should be equally careful about defining the mass of a moving 
particle. To be really precise we call m the rest mass of the particle and we define it in 
the following way: 


The rest mass m is the Newtonian mass as measured by an observer relative to 
whom the particle is at rest. 


Because of the way it is defined, the rest mass is totally independent of the speed of the 
particle. Although the particle may be moving with a very high speed relative to me, 
the rest mass of the particle in my frame is still the mass that would be found by some 
other observer relative to whom the particle was at rest. The rest mass is not a relative 
quantity—it has the same value for all observers and is independent of the speed of 
the particle. 


You may have heard it said that mass depends on speed in special relativity. This 
statement arises from the fact that some authors prefer to write equation 35 in the 
form: 


p= Mia v, where Mia = 


is 
N 


The quantity M,,, is called the relativistic mass and, as you can see, it depends on the 
speed v of the particle. M, gets bigger as v increases. However, we will not make any 
further use of the concept of relativistic mass in this Unit. Whenever we refer to a 
mass we will always mean a rest mass and the mass will therefore always be independent 
of speed. 


Why should the quantity defined in equation 35 be called relativistic momentum? 
What relation does it have to the Newtonian concept of momentum and what’s 
wrong with the Newtonian concept of momentum? The following two points will 
answer these questions. 


(i) Relativistic momentum is conserved 


In all experiments involving colliding particles, it is found that relativistic momentum, 
as defined in equation 35, is a conserved quantity. For example, in the collision shown 
in Figure 35 the following relationship would be true: 


relativistic momentum relativistic momentum 
before collision after collision 


- + (36) 


48 


relativistic mechanics 


relativistic momentum 


rest mass 


relativistic mass 


6.2.2 


(a) before collision 
v, 
.— 


(b) collision 


(c) after collision 


When expressed in this way, the law of the conservation of (relativistic) momentum 
takes the same form in all inertial frames. 


(ii) At low speed p z mv 


You should already know, from Section 5.3, that when v/c is small the factor 


1 — (v?/c?) is almost indistinguishable from 1. So at low speeds the definition of 
relativistic momentum given in equation 35 takes the approximate form: 


p x mw (37) 


The conservation of relativistic momentum, combined with the similarity between 
relativistic momentum and Newtonian momentum at low speed, explains why 
centuries of low-speed experiments led to the conclusion that Newtonian momentum 
was conserved. This conclusion is now known to be quite wrong, though for many 
everyday problems it is an adequate approximation. 


Before leaving the subject of relativistic momentum there is one other point that 
should be mentioned. Some particles, such as photons—the ‘particles’ of light— 
always travel at speed c. You might think that equation 35 implies that such particles 
have infinite momentum, but that is not the case. All particles that travel at the speed 
of light have zero rest mass (m = 0), so they are not subject to equation 35. The 
relativistic momentum ofa zero rest mass particle, such as a photon, has to be worked 
out from a different formula, which will be discussed in Section 6.2.3. 


Relativistic force 


The definition of relativistic momentum given in equation 35 can be used to help 
define the concept of relativistic force. As usual, things must be arranged so that when 
v/c is small the usual Newtonian results are recovered, since they are known to be 
accurate at low speed. Bearing this in mind, a reasonable definition of force is: 


_ dp d mv 
ea, 68) 


This says that the force acting on a particle is equal to the rate of change of the 
(relativistic) momentum of the particle, an idea you may recall from Unit 3. Equation 
38 is the relativistic analogue of Newton’s second law. 


It follows from equation 38 that if a constant force acts on a particle, the (relativistic) 
momentum of the particle must increase at a constant rate. In Newtonian mechanics 
this would mean that the speed of the particle also increased at a constant rate, since 
speed and momentum are directly proportional in Newtonian mechanics. However, 
equation 35 implies that speed and momentum are not directly proportional in 
relativistic mechanics, so an increase in relativistic momentum does not cause a 
proportional increase in speed. 


Figure 36 shows the way in which the speed of a particle and the magnitude of its 
momentum are related in relativistic mechanics. As you can see, increasing the 


$271 UNIT 12 


Figure 35 (a) Two particles, of mass 
m, and m,, move with constant speeds in 
the same direction. The speeds of the 
particles are v, and v, respectively, and 
v, is greater than v2. 


(b) Eventually, the two particles will 
collide. It may be assumed that no energy 
is lost in this particular collision. 


(c) After the collision the particles 
continue to move in the original direction, 
but their speeds have changed. The 
particle of mass m, now has speed w, 
and the particle of mass m, has speed w;. 


relativistic force 


49 


6.2.3 


momentum of the particle by a given amount has a diminishing influence on the 
speed of the particle as the speed increases. For example, by using equation 35 you 
should be able to convince yourself that increasing the magnitude of the momentum 


from mc to 2mc increases the speed by 27% (from c/ /2 to 2c] /5), while increasing 
the momentum from 2mc to 3mc only increases the speed by 6% (from 2c] /5 to 


3c/,/10). 


Applying a constant force to a particle for a very long time would certainly give it a 
large relativistic momentum, but that does not mean that the particle would ever 
attain the speed of light. As you can see from Figure 36, the speed of light is a limit 
that no particle of non-zero rest mass can ever quite reach, however large its momen- 
tum may be. 


SAQ 24 A particle of mass 5 kg has an initial speed of 2 x 10° ms”'. The 
particle is acted upon by a constant relativistic force of magnitude 10° N 
acting in the same direction as the initial relativistic momentum. If the force 
acts for 10? s, work out the following quantities: 


(a) The initial magnitude of the relativistic momentum. 
(b) The final magnitude of the relativistic momentum. 
(c) The final speed of the particle. 


Relativistic energy 


As you may recall from Unit 3, the concept of force is linked with the concept of 
energy by two less fundamental concepts: work and power. The relativistic definitions 
of work and power are the same as those used in Newtonian mechanics. So as we 
work towards a definition of relativistic energy we will appear to be following the 
same pathway as Newtonian mechanics. However, you must not forget that our 
point of departure—the definition of momentum — was different from that of Newton. 
This means that the point at which we end our journey—the definition of energy— 
will also be different. 


Consider a constant force F (relativistic, naturally) acting on a particle of mass m. 
If the force causes the particle to move through a small displacement As, the amount 
of work that has been done is given by the scalar product of F and As. If we call this 
small amount of work AW we can write 


AW = F-As 


Now, if you think back to Unit 3 you should recall that the power of a force is the 
rate at which that force does work. So, if the particle we were just considering required 
a time At to move through the small displacement As we could say that the power P 
of the force F was 


AW As 
je oS 
At At 


But, of course, As/At is just the velocity v of the particle, so we can write 
PE» 


Another fact that you should recall from Unit 3 is that the power of a force is the rate 
at which that force transfers energy to the particle. Adopting this relationship as part 
of relativistic mechanics, we can write down the following equation for the rate at 


which the relativistic energy E of the particle changes with time: 
dE 
—=P=F (39) 
dt 


This looks exactly like a formula from Newtonian mechanics, but if equation 38 is 
used to eliminate F from equation 39, the result is distinctly non-Newtonian: 


dE d 
=v: sa (40) 
v 


Using calculus it is possible to solve equation 40 and find an explicit formula for the 
relativistic energy E. The arguments required to obtain the formula are above the 


50 


p^mv 


(Newtonian) 
T i T » 
[4 c 3c CV 
4 2 4 


Figure 36 The magnitude p of the 
relativistic momentum plotted against 
the speed v. Near v — c, even large 
increases in the relativistic momentum 
only have a small effect on the speed. 
(The Newtonian momentum magnitude 
is also included for comparison.) 


level of this Course, so we shall simply quote the result without proof.* However, the 
result itself should not surprise you; it only involves ingredients that are already 
present in equation 40. 


(41) 


This is a very important formula: it provides an expression for the relativistic energy 
E ofa particle of rest mass m, according to an observer in an inertial frame relative to 
which the particle has speed v. At first sight it may seem that relativistic energy has 
little or nothing to do with the concept of energy used in Newtonian mechanics. 
However, that is not the case. It is quite simple to find a link between the relativistic 
energy defined in equation 41 and the low-speed Newtonian concept of kinetic 
energy (Ej, = imv?). 


The key to this link is in the factor 1/,/1 — (v?/c?). With the aid of a calculator you 
can easily show for yourself that if v/c is sufficiently small, less than 0.2 say, then the 
following approximation is fairly accurate 


2 


1 I9 
Iti 


: - (42) 
v 


Using this approximation in equation 41 gives the following approximate expression 
for the relativistic energy of a low-speed particle (v = 0.2c, or less, say): 


= 
Ex me(1 +35) (43) 
c 
ie. E x mc? + imi? 
v 
Thus, E — mc? x 4mv’, for small — (44) 
c 


So if we subtract the constant mc? from the relativistic energy, what remains is 
approximately equal to the Newtonian kinetic energy, provided the particle is not 
moving too fast. 


This result suggests yet another definition. If E — mc? is approximately equal to the 
Newtonian kinetic energy when v is small, why not use E — mc? as the definition of 
relativistic kinetic energy? By doing so we will guarantee that for low speeds the 
relativistic kinetic energy is approximately equal to the Newtonian kinetic energy. So, 
for a particle of mass m and relativistic energy E, the relativistic kinetic energy is 


defined by 
relativistic kinetic energy = E — mc? (45) 


By rewriting equation 45 in the form 
E = relativistic kinetic energy + mc? 


you can see that the relativistic energy E consists of two parts. The first part is the 
relativistic kinetic energy, which depends on the speed of the particle. The second 
part is given by mc? and it does not depend on the speed of the particle at all. But what 
is mc”, and what if anything, is its physical significance? 


It is clear on dimensional grounds that mc? must be some form of energy. Each of the 
other terms in equation 45 is a form of energy, so mc? must also be a form of energy. 
Because mc? depends on the rest mass of the particle it is called the rest-mass energy. 
Physically, the rest-mass energy of a particle is the energy ‘tied up’ in the particle’s 
mass. More will be said about this distinctly non-Newtonian equivalence between 


* If you are unhappy about simply accepting such a result and would like to see the arguments 
that link equations 40 and 41, refer to the book by R. Katz, An Introduction to the Special 
Theory of Relativity published by Van Nostrand 1964. However, in order to understand the 
argument you will need to know the principles of integral calculus. 


relativistic energy 


relativistic kinetic energy 


$271 UNIT 12 


51 


6.2.4 


mass and energy in the next subsection. For the present we will simply make do with 


the definition 
rest-mass energy = mc? (46) 


Putting equations 45 and 46 together we obtain the following equation 


relativistic energy = relativistic kinetic energy + rest-mass energy 


ITQ 8 Using equations 35 and 41, verify the following relationship between 
E (the relativistic energy) and p (the magnitude of the relativistic momentum): 


E? = p*c? + mct (47) 


Equation 47 is more than just an interesting relation between relativistic energy and 
relativistic momentum. It is an important result in its own right, because it enables us 
to extend the ideas of relativistic mechanics to cover particles of zero rest mass, such 
as photons (‘particles’ of light). It has already been stated that such particles always 
travel at speed c, and you can now see from equation 47 that the relativistic energy of 
such particles is simply related to the magnitude of their relativistic momentum by 
the formula 


E=cp (provided m = 0) (48) 


Experiments involving collisions between massless particles and massive particles 
show that equation 48 is indeed valid. You will meet this idea again in Unit 14, when 
such collisions are discussed in the context of quantum mechanics. 


SAQ 25 (a) What is the rest-mass energy of a proton? 
(m, = 1.67 x 10°?’ kg) 


(b) At what speed must a proton travel if its relativistic kinetic energy is equal 
to its rest-mass energy? 


(c) What is the relativistic energy of a proton that has a relativistic kinetic 
energy three times as great as its rest-mass energy? 


c 
(d) What is the relativistic energy of a proton with speed v — 3 ? 


SAQ 26 Consider a particle with relativistic momentum p given by 


al 1 


p.—3kgms !, p,—-4kgms- , p, = 0. 


(a) Ifthe particle has mass m = (5/3) x 10° ®kg, what is its relativistic energy? 


(b) Ifthe particle has mass m = 0, what is its relativistic energy? 


Relativistic energy conservation 


You will probably not be surprised to learn that relativistic energy is a conserved 
quantity. However, it is important to realize that the conservation of relativistic 
energy has dramatic consequences that lie well outside the realm of Newtonian 
mechanics. In particular, conservation of relativistic energy means that it is possible 
for processes to take place in which mass is converted into kinetic energy, or even 
kinetic energy converted into mass. A few examples should convince you of the 
interconvertibility of mass and kinetic energy. 


(a) before decay 


(b) after decay 


c 


NIV 


photon 


52 


rest-mass energy 


Figure 37 The decay of a x° meson into 
a pair of photons. The diagram shows 
the decay as it would be seen by an 
observer relative to whom the n? meson 
was at rest. According to such an 
Observer, the photons travel in opposite 
directions away from the original 
location of the 1? meson. 


Figure 37 is a representation of a process known as n? (pronounced ‘pi-zero’) meson 
decay—a process that has been observed experimentally a great many times. In this 
process a certain kind of elementary particle, called a n? meson, spontaneously decays 
into electromagnetic radiation. The radiation consists of two photons—two massless 
particles of light —that travel away from the original location of the n° meson. If the 
decay is observed from the frame in which the n° meson is at rest, then the two photons 
will travel in opposite directions, as shown in Figure 37b. Before the decay, all the 
relativistic energy is in the form of rest-mass energy. After the decay (Figure 37b) 
there is no rest-mass energy at all, but there is relativistic kinetic energy. Each of the 
photons has a relativistic kinetic energy equal to half the rest-mass energy of the n° 
meson. So, the total relativistic energy of the two photons is exactly equal to the 
rest-mass energy of the original n? meson. Thus relativistic energy is conserved, but 
rest mass is not. 


(a) before collision 


v at rest 
o—> o 
mp mp 


(b) after collision 


Figure 38 shows another process involving n° mesons in which relativistic energy is 
conserved, though again, rest mass is not. The collision shown in Figure 38 is a 
simplified and somewhat idealized example of the kind of process in which n° mesons 
are created. A proton of mass m,, travelling at a high speed v, collides with another 
proton that happens to be at rest in the laboratory. As a result of the collision the 
protons move off with speeds w, and w,, and a n? meson of mass m, and speed w, 
is created. The law of relativistic energy conservation demands that 


m,e? m,c m,c 
2 
RE MDC 


2 2 2 tz 2 

v Ww wW Ww 
= i t= = 
€ € C € 


ie. initial relativistic energy = final relativistic energy. 


What has happened is that the rest-mass energy of the n? meson has been created at 
the expense of the kinetic energy of the protons. Although the form of the energy has 
altered, the total relativistic energy before and after the collision is unchanged. 


The equivalence of mass and energy is a basic principle, and is not restricted to exotic 
entities such as n? mesons. For example, suppose that a block of material is heated 
so that its internal energy increases; the increase in internal energy would produce a 
corresponding increase in rest mass. Or consider an extended spring, which will 
have an infinitesimally greater rest mass than the same spring in its unextended 
state. 


Perhaps the most dramatic example of mass-energy equivalence is provided by the 
detonation of an atomic or hydrogen bomb. Such devices work by converting 
rest-mass energy into kinetic energy (and, ultimately, heat). The energy released when 
a mass m, at rest, is destroyed is given by E — mc?. This can be very large, even for a 
small value of m, due to the enormous size of c?. Total conversion of 1 kg yields the 
same amount of energy as the detonation of 20 megatonnes of TNT. The destruction 
of Hiroshima and Nagasaki involved the annihilation of just a few grams of matter. 
This stark fact shows that special relativity is not just the concern of those who 
wish to create n? mesons; it may well control the destiny of us all. 


SAQ 27 It was stated that the two photons shown in Figure 37b travel in 
opposite directions, in the rest frame of the n? meson, each carrying an energy 
equal to half the rest-mass energy of the x? meson. Why must this be so? Why 
can't one photon carry more or less energy than the other, provided their 
total energy still adds up to the rest-mass energy of the n° meson? 


S271 UNIT 12 


pi-zero meson 7° 


Figure 38 The creation of a x? meson 


in an energy-conserving collision of two 
protons. (Relativistic momentum is also 


conserved in such a collision.) 


53 


6.3 


SAQ 28 Am? meson can decay in a number of different ways. For example, 
one possible mode is that in which the x? meson decays into an electron and a 
positron. (A positron has the same mass as the electron, but carries the opposite 
electric charge.) What is the speed of the electron emitted by a decaying n° 
meson, as measured in the frame in which the n° meson is at rest? (The rest 
mass of the n° meson is 2.40 x 10^? kg.) 


Summary of Section 6 


According to the principle of relativity, the laws of physics must take the same form 
in all inertial frames. 


The laws of Newtonian mechanics fail to do this, so they must be modified. 


Relativistic mechanics incorporates all of the successes of Newtonian mechanics and 
much else besides. 


The relativistic momentum of a particle of mass m and velocity v is defined by 


mv 


p= 


Both relativistic momentum and relativistic energy are conserved quantities. 


The total destruction of a mass m, while at rest, would liberate the rest-mass energy 
E = mc’. 


Conclusion 


Einstein’s special theory of relativity is one of the greatest achievements of modern 
physics. The theory holds enormous appeal for a wide audience, many people who 
have never heard of physics at least know of the existence of Einstein’s theory. To 
physicists, the appeal of special relativity comes largely from its logic and internal 
consistency. The theory demands that we throw away our prejudices, and put our 
trust in a few simple experiments and our own powers of reasoning. Once we have 
accepted Einstein’s postulates, all else follows inevitably; we are carried along on an 
irresistible tidal wave of logic. If we don’t like the results, that’s just too bad; they are 
unavoidable once the postulates have been accepted. 


Quite apart from its intellectual appeal, relativity also has great practical value. The 
principle of relativity and the Lorentz transformation provide a kind of framework 
into which would-be physical theories must fit. This gives a way of ‘testing’ physical 
theories without recourse to experiment. For this reason, the theory of relativity is 
one of the most useful weapons in the arsenal of the theoretical physicist. 


Finally, for a wider audience, relativity has the appeal of the exotic. Its results are 
sufficiently strange to be intrinsically interesting, and yet not so bizarre as to be totally 
beyond belief. The student coming to grips with relativity for the first time expects 
to have his mental powers stretched by the demands of Einstein’s theory. In studying 
this Unit, you will, hopefully, have experienced that mental extension, and benefited 
from it. Perhaps you have even enjoyed the experience. 


“Nothing puzzles me more than time and space; and yet nothing troubles me less, as 
I never think about them.’ (Charles Lamb) 


54 


Objectives 
After studying Unit 12, you should be able to: 


1 Outline two experiments which demonstrate that the speed of light is independent 
of its source (SAQ 1; TV 12) 


2 Outline the Michelson-Morley experiment and explain how it gives some support 
to the idea that the speed of light is independent of the motion of the detector (SAQ 2) 


3 Define the following terms: event, frame of reference, observer and inertial frame 
(SAQs 4, 5 and 6) 


4 Explain the terms coordinate transformation and standard configuration, and 
both recognize and use the Galilean coordinate transformation (SAQs 9 and 10; ITOs 
3 and 4) 


5 State Einstein’s two postulates, and both recognize and use the Lorentz trans- 
formation (SAQs 7, 8 and 12-15) 


6 Explain the difference between the nature of time in the Galilean coordinate 
transformation and the Lorentz transformation (SAQs 15 and 16; ITQ 5) 


7 Recognize the Galilean velocity transformation (equation 11), and the special 
relativistic rule for the transformation of the x-component of velocity (equation 28) 
(SAQs 11, 17 and 18; ITQs 6 and 7) 


8 Write down and use formulae describing the Lorentz contraction (equation 32) 
and the time dilation effect (equation 34) (SAQs 19-23; TV 12) 


9 Write down and use formulae for the relativistic momentum (equation 35) and 
the relativistic energy (equation 41) of a particle of mass m and velocity v (SAQs 


S271 UNIT 12 


24-28: ITO 8) 


ITQ answers and comments 


ITQ 1 (a) Equation 3 is a consequence of Coulomb’s law 
(Unit 6) and Newton’s second law (Unit 3). According to Coulomb’s 
law, the force F acting on a charge q when it is at a displacement r 
from a fixed charge Q is: 


(The unit vector (r/r) shows that the force is in the same direction as 
the displacement vector r of q from Q.) 


But from Newton’s second law F = ma 


= co A 
4reomr? Vr. 


Remembering that (r/r) is simply a vector of unit magnitude, and in 
this case it points along the x-axis, it follows that 


qQ 


Arne, mr? 


so a 


x 


(b) As you can see by comparing this expression for a, with 
equation 3, the value of the constant is: 


qQ 


4n£om 


constant — 


ITQ 2 This is a difficult question to answer because it touches 
upon philosophy as well as physics. However, in principle the answer 
is no, because in order to be a law of physics, a statement must be 
true in all inertial frames, not just two. Of course, it is never possible 
in practice to test a law experimentally in every one of the infinitude 
of inertial frames, but don't worry about that at this stage. Section 4 
will explain how this problem can be overcome, so that experimental 
evidence obtained in your frame alone is sufficient to establish a 
physical law, provided that law can indeed be written in the same 
form in all inertial frames. 


ITQ 3 (a) (i) According to the observer in the adjoining room, 


the point that you call x = —3m, y 22m, z= 1m, has 
coordinates x = —9m, y —2m, z —im. 
Gi) The origin of your frame is atx’ = —6m, y =0, 2’ =Q. 


(i) The origin of the new frame is x’ 20, y =0, 2’ =0. 


(b)-.0).-—x 29m, y —-7m,.z— 0: 
(ii), x —dm-yocdm,.z-—2m. 


ITQ 4 (a) The clock of the observer in the adjoining room is 
always 1.2s ahead of your clock. So an event that occurred at 
t = 30sin your frame would happen at t’ = 42 sin the frame of the 
adjoining room. 


(b) Similarly, your clock is always 1.2s behind the clock in the 
adjoining room. So an event that happens at t’ = 0.6 s according to 
the new observer, takes place at t = 0.6 — 1.2 = —0.6 s according 
to you. 


ITQ 5 (a) The origins are at x = 0 and x’ = 0. It follows from 
the first part of equation 10 that if x’ = 0 and x = 0 then t = 0. 


(b) Italso follows from the last part of equation 10 that any event 
that occurs at t = 0 in frame A also occurs at t’ = 0 in frame B. 


(c) O and O' coincide at t = t' = 0. With t = 0 and x = 10° m, 
equation 10 implies that x’ = 10° m. Of course, y' = y = 0 and 
zZ=z=0. 


(d) An event that occurs at x = 10? m in frame A and x’ = 108m 
in frame B must have taken place at time t, where x’ = x — Vt. Thus 


105m = 10° m — Vt = 10m — 10? ms ! xt 
10? — 108 10? 108 


ie = s s s=9s 
: 108 108? 108 


(e) From the last part of equation 10 it follows that the event 
discussed in part (d) happened at f = 9 s in frame B. 


55 


ITQ 6 No trick, the answer is simply 20 m.p.h. The amazing 
thing is that your everyday experience is not quite correct. 


ITQ 7 Suppose that in frame A a particle is observed to travel 
from an event at (x,, Y1» Z1» t1) to an event at (x5, Y2, Z2, t2). If the 
two events are infinitesimally close together the x- and y-components 
of the instantaneous velocity of the particle, as measured in frame A, 
are 

5s Jac 


” h= * b-t 


Z2 = Zi 


If these events are observed from a frame B that is in standard 
configuration with frame A, then the coordinates of the events 
according to an observer in frame B are related to the coordinates 
according to an observer in frame A by the Lorentz transformation. 
In particular 

y =y V2 = Ja 


Zi =Z 22 = Z2 
t, — Gau Ve?) -ot2— (X2 Vic?) 


JA — (V2/c?) V1 -(V?/c?) 


The instantaneous velocity component v; of the particle in frame B is 


t. 


G= 


ah 
er 
= Yee 
ie. ka 
EN - VIA ft — (V2/c?). 


7 G2 — yov l= (V7/c’) 


Le. I = 
’ ty — ty — (x; — xı)(V/e?) 


SAO answers and comments 


SAQ 1 Both beams will strike the planet at the same time because 
the speed of light is independent of the speed of its source. The 
speeds given in the question are irrelevant; all that matters is that 
the lasers are fired simultaneously, while they are both at the same 
distance from the planet. 


SAQ 2 Your neighbour’s explanation of the null result might 
seem rather improbable, but it is none the less possible. The explana- 
tion can be disproved in a number of ways. For example, if you 
repeated your experiment a few hours after your neighbour’s visit, 
the rotation of the Earth would probably have reoriented your 
equipment in such a way that the direction of motion through the 
ether was no longer perpendicular to the plane of your table top. 
If your neighbour was still unconvinced, you could wait a few 
months so that the Earth’s orbital motion around the Sun had also 
changed the orientation and speed of your table top with respect to 
the ether. If a ‘table top’ Michelson-Morley experiment were 
possible, you could probably rotate it into a vertical rather than a 
horizontal plane and that would give you a way of disproving your 
neighbour’s claim immediately. (The original Michelson-Morley 
experiment could only be rotated in the horizontal plane, since it 
was built on a heavy base which provided the kind of rigid support 
needed for a high precision interference experiment.) 


SAQ 3 Ina vacuum, the speed of light relative to a detector is 
always c. So the speed of the light emitted by the laser is not only c 
relative to the detector orbiting the Earth, but it is also c relative to 
the detector on board the spacecraft. The fact that the two detectors 
are moving relative to one another is irrelevant, as is the fact that 
the detector on board the spaceship can be regarded as measuring 
the speed of the light relative to its source. All that matters is the 
speed relative to the detector. 


56 


Dividing both the numerator and the denominator by (t; — t), we 


get 
UIS Ue) 
v, = EN er Teen 
Vv, 
c 


= Dayal — P2) 


Similarly v, = 
Vv, 


c 


1 


ITQ 8 Substituting the expressions 


2 


mc 
pe rn 
Jt— (v2/c?) 
mv 
and p= 


JA = (v/c?) 


into equation 47 gives 


m?c* m?v2c? 2s 
x= sv + m’c 
ees eie 
c^ c? 
; m?c* m?v?c? + m?c*(1 — v?/c?) 
ie. 
pee pt 
c c? 
m?c* 


Thus the left and right sides of the equation are identical, which 
verifies the validity of equation 47. 


SAQ 4 The light signal will take 100 seconds to travel from A to B. 
When B sees the signal he knows that the time according to A is 
t= 100s. As B takes one second to adjust his clock, he should 
start it at 101 seconds. 


SAQS It takes light (16 — 10)/2 = 3 seconds to reach C from D. 
It also takes a light signal 5 seconds to travel from D to E. Thus, the 
total distance between C and E is 8 x 3 x 10° m = 24 x 10? m. 


SAQ 6 By Pythagoras' theorem, the distance between the origin 
and the point at which the explosion occurred is „/(x? + y?) = 
./9 + 16 x 105 m = 5 x 10* m. So light takes 1.67 s to travel from 
the explosion to the origin. If light from an explosion reaches the 


origin att = 12s, an observer would say that the explosion occurred 
att = (12 — 1.67) s = 10.33 s. 


SAQ 7 The observer in the other frame should find that an 
expression of the same form is valid in his frame 
dex dx 


ie: — = 
dit? dr 


HKI = XY + 6 - YY ee - zy) 
where k’ is constant. 


SAQ 8 No, your friend's claim is not justified. He can certainly 
distinguish his frame from yours by observing the motion of the 
bridge in his frame, but that does not discredit the principle of 
relativity. The point is that the principle of relativity does not claim 
that all inertial frames are totally indistinguishable. (Indeed, it is 
obvious that frames which have their origins in two different 
adjoining rooms could be distinguished by something as simple as 
the colour of the curtains nearest to the origin of each frame.) The 


principle of relativity ‘merely’ claims that the laws of physics take 
the same form in all inertial frames. So, there is no particular frame 
singled out by the laws of physics. The speed of the bridge in your 
friend’s frame is not controlled by a law of physics, it is merely a 
direct consequence of his viewpoint. It can be called an artefact of 
his frame rather than a result discovered in his frame. 


SAQ 9 From the coordinate transformation given in the question 
you can see that 


x=y+6m 
y= -x 
EA 
t=t-—E2s 


This can be used to transform the various parts of equation 4: 
the acceleration component d?x/dt? in the original frame becomes 
d?y/dt? in the new frame. Neither the shift in the origin, nor the 
replacement of x by y' has any physical effect on the acceleration 
component. 


The term (x — X)? becomes (y' + 6m — Y' — 6m)? which is just 
(y' — Y’)?. The constant in equation 4 will remain the same, since 
none of the quantities involved depends on the coordinate system. 
Thus, the result that the new observer would deduce from the 
experiment that led to equation 4 would be 

d?y constant 


d? (y-Yy 
This has exactly the same form as equation 4. 


SAQ 10 No. The arrangement described in the question does not 
include any guarantee that the two origins will coincide at t = t' = 0, 
and, in fact, they don't, as will become clear later. The definition of 
standard configuration means no more and no less than what it 
says. The whole point of having such a careful definition of standard 
configuration is that it allows us to discuss inertial frames in 
relative motion without having to make any assumptions or worry 
about any possible implications. 


SAQ 11 (a) In frame A the x-component of the missile's 
velocity is 


= 1 
v, = /2V cos 45° = V (since cos 45° = —) 
JA 


so it follows from equation 11 that in frame B 
=. -V=0 

(b) In frame A, v, = JAY sin 45° = V (since sin 45° = —) 

so from equation 11 v, = v, = V. v2 


(c) In frame B the missile is travelling at 90° to the x'-axis. 


SAQ 12 From equation 16 
18 x 108 — [(3/5) x 3 x 10° x 10] 
m 


JA — (9/25) 


=0 


y = -Im 


gu 


, 


10 — [G/5) x 3 x 10* x 18 x 10*/(9 x 10'9] 
JA — (9/25) 
10 — (18/5) (60-18) 5 
= S = x S 
16/25 5 4 


so the new coordinates are x’ = 0, y' = 


t 


8s 


ie. 


1lm,z’ =0,t' = 8s. 


SAQ 13 (a) The x-component of the rocket’s velocity is 
v, = v cos 45° = ,/2 x 105 cos 45° ms! = 108 ms^! 


At time t the x-coordinate of the rocket is v, t. 


S271 UNIT 12 


Att=1s, x=10°m 
Att=2s, x —2 x 108m 
Att —3s, x —3 x 10°m 


(b) From the Lorentz transformation (equation 16) it follows that 
the first flash occurs at 


a 10° — [(3/5) x 3 x 10° x 1] 108 — 1.8 x 108 
I m= = -10°m 


JA — (3/5)? 16/25 


and 


1 — [(3/5) x 3 x 10® x 105/(9 x 10*°)] 1 — (1/5) 
JA = G/5P ae 
The second flash is at 
2 x 10° — (9 x 10? x 2/5) 
m= 


ť 


1s 


2.x 108m 
4/5 
, 2-— [(18 x 10'%/(45 x 10'°)] 2 
s=2s 
4/5 
The third flash is at 
10° — (9 Bx 3/5 
x ax Qx NSD —3 x 105m 
4/5 
‚3 — [Q7 x 1016)/(45 x 1015] 
= S35 


4/5 


It is interesting, but purely accidental, that the times of the events 
are the same in both frames. SAQ 12 shows that the time coordinate 
t of an event in frame A is not necessarily the same as the time 
coordinate t' of that same event in a different frame B. 


SAQ 14 The Lorentz transformation is: 


x — Vt 
Mec o (49) 
= J1 — (V2/c?) 
Ee (50) 
ZI 2 (51) 
zz 2 
jx t — (xV/c^) (52) 


JA — (?/c?) 
Using these equations we must find an expression for x and t in 


terms of x’ and t. From equation 52, t = t',/1 — (V?/c?) + (xV/e?). 
Substituting this into equation 49 gives 


t x— vE — Ve) + xV/c?) 
i= 7/2) 
ee ees) 
TA 
LE 
JÀA-Que 
x 4 Vt 


ie. x = ——————— asrequired. 
1 — (V?/c?) 


Vt 


x + Vr 


Similarly, from equation 49 


x= xJ1— Q8 + Vt 


Substituting this into equation 52 


t — (x1 — (V2/c?) + Vt)V /c? 
JA — (V?/c?) 
t — Vt(V/c?) E 


J1 — (V?/e?) 


ie. t= x'(V/c?) 


57 


t(1 — (V*/c?)) 


ie. == =f + (x'Vlc?) 
V1 — (V?/c?) 

: t + (x'V/c?) : 

Le. t= as required. 


JA — (V?/ce?) 
And, of course, y = y' and z = z' as required. 


The four equations obtained above can be obtained much more 
simply from the Lorentz transformation by noting that the origin 
of frame A moves along the x'-axis of frame B. The x'-component of 
the velocity ofthe origin of frame A is — V. So to find the coordinate 
transformation from B to A, just interchange all the primes in 
equation 16 (i.e. interchange x and x' etc.) and replace V by —V 
wherever it occurs. 


SAQ 15 Usingť = t — (xV/c?) 


y? 
E- 
(a) ty =7—[(4 x 12 x 105/5 xo] 7-48 
Ji-# i 
t, = 6.33s 


7—[(4 x 18 x 108(5 x 0] 7-24 


th = 3.665 


[m 


(b) The two events are not simultaneous according to B—they 
occur at different values of t'. 


SAQ 16 Because the two events happen at the same time in 
frame A, but at different positions, it is quite impossible for any 
signal moving at the speed of light or slower to travel from one 
event to the other. This means that in frame A neither of the events 
could have caused the other. But this kind of causal relationship 
between two events is independent of the frame in which the events 
are observed. So the fact that neither event could have caused the 
other in frame A means that there does not exist any frame D in 
which one event caused the other. (Of course, it is possible to find 
frames in which e, happened before e;, but that does not mean that 
€, caused e; or even that e, could have caused e;.) 


SAQ 17 Imagine that the frog is at the origin of frame A and the 
gnat is at the origin of frame B. The frames A and B are in standard 
configuration with V — c/2. The x-component of the velocity of the 
frog's tongue tip in frame A is v, — 3c/4. The question may be re- 
phrased as: ‘What is the x -component of the velocity of the tongue 
tip in frame B?' The answer follows from equation 28, and it is 


v, —V c/4 


ke eS 
oes 8 


c 


SAQ 18 In this case you must picture the frame B to be travelling 
towards the origin of frame A. This situation is easy to deal with if 
the V that appears in equation 28 is put equal to — c/2. (Remember, 
V is really a velocity component.) So, using 


it follows that the speed of approach of the tongue, according to the 
gnat is 
5c/4 = 10 


5 = — = 0909c. 
ee SST 


r 
DO 
Ux 


SAQ 19 When viewed from the frame in which the cosmic ray is 
at rest, the distance between the Earth and the balloon will be 
reduced due to the Lorentz contraction. The contracted length will 
be 


58 


L = Lo A — V/A 
ie. L = 10*,/1 — (0.996)? m 
L = 10*,/1 — 0.992 m 


so L = 894m 


SAQ 20 According to an observer on the Earth, light takes 4.2 
years to travel from the Earth to Alpha Centauri. So an astronaut 
travelling at 0.9c would require 


T = 4.7 
= — years = 4. ears 
0.9" É 


to complete the journey according to an observer on Earth. How- 
ever, the astronaut is moving relative to the observer on Earth, so 
the astronaut's clock is running slow compared with clocks on 
Earth. The interval of time that had elapsed in the astronaut's frame 
would be 


y? 
z = 4.7,/1 — (0.9? years 


[4 


ei! 


Le: T' — 2.0 years 


You can also obtain this answer by viewing things from the 
astronaut's point of view. In his frame the star is moving at 0.9c, and 
the distance between the Earth and the star is Lorentz-contracted. 
You may have had some difficulty deciding whether the journey 
time according to the observer on Earth was T or T' in equation 34. 
The easiest way of making the right choice is simply to think about 
the question and decide which interval of time will be shorter. The 
shorter interval is called T' in the equation. 


SAQ 21 (a) The return leg of the round trip takes just as long 
as the outward leg. So according to the astronaut the round trip 
takes 4 years. 


(b) According to the twin who stayed on Earth, the round trip 
took 9.4 years. This answer deserves a lengthy comment. The 
difference in the round-trip times means that when the astronaut 
returns to the Earth his twin brother is more than five years older 
than himself! This is an interesting effect, but not too surprising 
once you have come to terms with time dilation. However, the 
difference in the ages of the twins is often discussed in the context 
of the so-called ‘twin paradox’. The twin paradox can be stated in the 
following way. 


From the point of view of the twin who remained on Earth, it is 
not surprising that the astronaut has aged less. The astronaut has 
been travelling at high speed, so time has passed more slowly in his 
frame. However, is it not equally true that from the point of view of 
the astronaut it is the twin on Earth who has been travelling at high 
speed? And if so, should it not be the case that it is the twin on Earth 
who should be the younger? 


Thisso-called paradox arises because it is being claimed (incorrectly) 
that either twin can equally well be regarded as the ‘moving’ twin 
because all speed is relative. It is certainly true that all speed is 
relative, but even so the two twins cannot be treated equally. The 
difference between them arises from the fact that it is the astronaut 
who had to turn around and reverse his direction of travel. This 
process involves deceleration and acceleration which, unlike speed, 
are not relative quantities. If you are in a completely closed box, 
there is no way of determining your speed, but you can certainly 
tell whether or not the box is accelerating. Acceleration can be ‘felt’. 
Only the astronaut feels the acceleration; the twin on Earth does 
not. Thus the two twins have different experiences; the twin on 
Earth has spent all his time in an (approximately) inertial frame, 
while the astronaut has spent part of his time in an accelerated 
(non-inertial) frame. Given this asymmetry, there is no reason why 
the twins should have aged equally, and so there is no paradox. The 
astronaut really does age less than his Earth-bound twin. 


SAQ 22 According to the formula given in the TV Notes, if we 
start with N, particles, with half life t, then the number that will 
remain after time t is given by 


NEN E 
If 10° neutrons are created at t = 0, the number remaining after 
t = 510 s (= 81 minutes) is 
N-—:106:x. 210/648 
N=— 108 2:087 
so N = 0.58 x 10° 


(If your calculator does not allow you to work out 2" 9:75" directly, 
you can accomplish the task by using logs. Remember, 2 = 10!°? 
so 2-9-781 zz (10122-0787 = 10~ 0-787 log 2 In other words, 
270.787 = antilog(—0.787 x log 2) = 0.58.) 


Note: the neutron’s half life is only known to within +25s, so the 
value of 648 s is merely an approximate value. 


SAQ 23 According to an observer on Earth, a clock in the rest 
frame of the neutron would be running slow—due to time dilation. 
So according to such an observer the time that had elapsed in the 
rest frame of the neutron would be 


T' = 510,/1 — (0.95)? s 
5105/9715 x10; "s 


= SIS 


(You can also obtain this result by viewing things from the frame 
in which the neutron is at rest. In that frame the Earth moves with 
speed 0.95c and the distance between the Sun and the Earth is 
Lorentz-contracted.) 


So the number of neutrons that reach Earth will be about 
N= 106 x 2 159/648 
= 10° x 270.245 


= 0.84 x 10° 


This number is considerably larger than the number calculated 
in SAQ 22, where it was assumed that the neutrons were at rest. The 
enhanced lifetime (according to an observer on Earth) is a conse- 
quence of time dilation. 


SAQ 24 (a) Using equation 35, the magnitude of the initial 
relativistic momentum is 

mv 5 x 2 x 10? 

VA — Ge) Y1-@9 


(b) As the force is constant, the momentum changes at a constant 
rate. The change in the magnitude is given by 


= 1.34 x 10? kgms^! 


Ap = F x At = 106 N x 10?s = 10°kgms t 
So the final magnitude of the relativistic momentum is 
Pe = 1.34 x 10° + 10° = 2.34 x 10? kgms^! 


(c) But from equation 35 it follows that the final speed v, is given 
by 


mvs 
PRI ——— 
1 — (Je?) 
2 
5 v 
ie. n(i = x) = mul 
c 
2 
zc + 5) =e 
c 
Pr 
1€ ve = == 
m + 2s 
/ = 
(0 234 x 10° 


So —0-533 x füfanms- 


Bp ZZ eee 
EE 9 


S271 UNIT 12 


SAQ 25 (a) The rest-mass energy of the proton is 
me — (1.67 10-7) x (97x10) 3— 31:50: 107 19 T. 


(b) If the relativistic kinetic energy of the proton is equal to its 
rest-mass energy, then the total relativistic energy E = 2m,c’. 
Using equation 41, we thus obtain 


m,c 
E = 2m,c? E 
"Si - eue) 
ie 1— (c) = 4 
s i et 
= c 4 
Th D 
= c 4 
3 
So re x 10° ms! 


2 


(c) If the relativistic kinetic energy is three times as great as the 
rest-mass energy, the total relativistic energy is 


E = 3m,c? + m,c? = 4m,c? = 60 x 1071°J 
(d) According to equation 41, the energy of a proton with speed 


v = c/2 is 


Me rg nr 


Ji 2E 


SAQ 26 The magnitude p of the relativistic momentum of the 
particle is 


p = (pz + p; + pz)? =Skgms 


(a) Using equation 47 with m = 3 x 10^? kg gives 


E?— 25. x (3x 105)? + @ x 10-5)? x G x 105) 7 
E^ = 25: x-(3--105)? 4-25 8x 10°)? I? 

Ee. E? —3[5 10°? 12 

So BE 212-210 J 

(b) Using equation 47 again, but with m = 0 gives 
E^ —25x:3-x-]05 712 

Tee 2555351022 15522 107 


SAQ 27 The photons must travel in opposite directions in the 
rest frame of the n° meson due to (relativistic) momentum conserva- 
tion. The stationary n° meson has no momentum, so the total 
momentum of the two photons must be zero. That means they 
must travel in opposite directions, each having a momentum that is 
equal in magnitude but opposite in direction to that of the other. 
Because the magnitude of the momentum of each photon is the 
same, it follows from equation 48 that the energy of each photon is 
the same. As the two photons must carry equal energy, and as their 
total energy must be equal to the rest-mass energy energy of the n? 
meson, it follows that each photon carries half the rest-mass energy 
of the 1? meson. 


SAQ 28 Itfollows from the conservation of relativistic momentum 
that, in the rest frame of the x° meson, the electron and the positron 
must travel in opposite directions, each moving at the same speed 
v. (The argument is the same as that used in SAQ 27, but now the 
two particles produced by the decay each have the same non-zero 
mass.) For the electron, E = 4m,c? 


So IO. ame 
V1 — (v?/c?) 
po 
ie. JA = Wye) = = 
m 


n 


59 


c m; 
v? m2 
Fee 
c m2 
4, 2 
So p= E = [^ 
ie. ji 4(9.101 x 10 31)2 
= 
(2.40 x 10-2872 © 


v=. 1 — 5/152-x-10-^c 


Thus v = 0.99997c 
Acknowledgement 


Grateful acknowledgement is made to the Hulton Picture Library for permission 
to reproduce the Frontispiece. 


60 


Fold-out page: standard configuration 
and the Lorentz transformation 


STANDARD CONFIGURATION ^z 


| | e) 


-* 


frame A 


frame B 


x the origins coincide at r= r’ = 0 


Two inertial frames, A and B, are in standard configuration when: 


1 The origin of frame B moves with constant speed V, in the positive direction 
along the x-axis of frame A. 


2 The two frames are oriented in the manner shown in the diagram. 


3 The origins of the two frames coincide at t = t' = 0. 


The central problem of relativity is to find the coordinate transformation that links 
two inertial frames, A and B, in standard configuration. 


The solution to the central problem of relativity is the Lorentz transformation: 


; x—Vt 
x = —— 
y? 
je 
Z 
y=y 
DT . 
, t-— (xVj/c?) 
= = 
a= 


